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TRANSPORTERS AND ION CHANNELS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of transporters and ion 
5 channels and to the use of these sequences in the diagnosis, treatment, and prevention of transport, 
neurological, muscle, immunological, and cell proliferative disorders, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of 
transporters and ion channels. 

10 BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells and 
organelles require transport proteins to import and export essential nutrients and metal ions including 
K + , NH^, S0 4 ^, sugars, and vitamins, as well as various metabolic waste products. Transport 

15 proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic neurotransmission, 
kidney function, intestinal absorption, tumor growth, and other diverse cell functions (Griffith, J. and C. 
Sansom (1998) The Transporter Facts Boot , Academic Press, San Diego CA, pp. 3-29). Transport 
can occur by a passive concentration-dependent mechanism, or can be linked to an energy source 
such as ATP hydrolysis or an ion gradient Proteins that function in transport include carrier proteins, 

20 which bind to a specific solute and undergo a conformational change that translocates the bound solute 
across the membrane, and channel proteins, which form hydrophilic pores that allow specific solutes to 
diffuse through the membrane down an electrochemical solute gradient 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 

25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it The 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 

30 Na + /K + ATPase system. Sodium-coupled transporters include the mammalian glucose transporter 
(SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have 
twelve putative transmembrane segments, extracellular glycosylation sites, and cytoplasmicaHy- 
oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of 
various thyroid pathologies because it is the molecular basis for radi&odide thyroid-imaging techniques 
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and for specific targeting of radioisotopes to the thyroid gland (Levy, O. et aL (1997) Proc. Nail. 
Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and placenta, 
and is implicated in the transport of the water-soluble vitamins, e.g., bi tin and pantothenate (Prasad, 
P.D. et aL (1998) J. Biol. Chem. 273:7501-7506). 
5 One of the largest families of transporters is the major facilitator superfamily (MPS), also 

called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide earners that 
transport small solutes in response to ion gradients. Members of the MFS are found in all classes of 
living organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, 
nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaiyotes all have a structure 

10 comprising 12 transmembrane segments (Pao, S.S. et aL (1998) Microbiol. Molec. BioL Rev. 62:1-34). 
The largest family of MFS transporters is the sugar transporter family, which includes the seven 
ghicose transporters (GLUT1-GLUT7) found in humans that are required for the transport of glucose 
and other hexose sugars. These glucose transport proteins have unique tissue distributions and 
physiological functions. GLUT1 provides many cell types with their basal ghicose requirements and 

15 transports glucose across epithelial and endothelial barrier tissues; GLUT2 facilitates glucose uptake 
or efflux from the liver; GLUT3 regulates ghicose supply to neurons; GLUT4 is responsible for insulin- 
regulated ghicose disposal; and GLUT5 regulates fructose uptake into skeletal muscle. Defects in 
ghicose transporters are involved in a recently identified neurological syndrome causing infantile 
seizures and developmental delay, as well as glycogen storage disease, Fanconi-Bickel syndrome, and 

20 non-insulin-dependent diabetes mellitus (Mueckler, M. (1994) Eur. J. Biochem. 219:713-725; Longo, 
N. and L.J. Elsas (1998) Adv. Pediatr. 45293-313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes Lrlactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforros have been identified to date. The isoforms are 

25 predicted to have twelve transmembrane (TM) helical domains with a large intracellular loop between 
TM6 and TM7, and play a critical role in maintaining intracellular pHby removing the protons that are 
produced stoichiometrically with lactate during glycolysis. The best characterized 
H + -monocarboxylate transporter is that of the erythrocyte membrane, which transports L-lactate and a 
wide range of other aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate 

30 transporters with differing substrate and inhibitor selectivities. In particular, cardiac muscle and tumor 
cells have transporters that differ in their values for certain substrates, including stereoselectivity 
for L- over D-lactate, and in their sensitivity to inhibitors. There are Na + -monocarboxylate 
cotransporters on the luminal surface of intestinal and kidney epithelia, which allow the uptake of 
lactate, pyruvate, and ketone bodies in these tissues, hi addition, there are specific and selective 
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transporters for organic cations and organic anions in organs including the kidn y, intestine and liver. 
Organic anion transporters are selective for hydrophobic, charged molecules with electron-attracting 
side groups. Organic cation transporters, such as the ammonium transporter, mediate the secretion of 
a variety of drugs and endogenous metabolites, and contribute to the maintenance of intercellular pH 
5 (Poole, R.C. and A.P. Halestrap (1993) Am. J. Physiol. 264:C761-C782; Price, N.T. et al. (1998) 
Biochem. J. 329:321-328; and Martinelle, K. and I. Haggstrom (1993) J. BiotechnoL 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane 
proteins that transport substances ranging from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC 

10 transporters consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to 
supply the energy required for transport, and two membrane-spanning domains (MSD), each 
containing six putative transmembrane segments. These four modules may be encoded by a single 
gene, as is the case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. 
When encoded by separate genes, each gene product contains a single NBD and MSD. These "half- 

15 molecules" form homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to 
defects in ABC transporters, such as the following diseases and their corresponding proteins: cystic 
fibrosis (CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), 
Zellweger syndrome (peroxisomal membrane protein-70, FMP70), and hyperinsulinemic hypoglycemia 

20 (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 
ABC transporter, in human cancer cells makes die cells resistant to a variety of cytotoxic drags used 
in chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Bnzymol 292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, selenium, 
nickel, and chromium are important as cof actors for a number of enzymes. For example, copper is 

25 involved in hemoglobin synthesis, connective tissue metabolism, and bone development, by acting as a 
cofactor in oxidoreductases such as superoxide dismutase, ferroxidase (ceruloplasmin) , and lysyl 
oxidase. Copper and other metal ions must be provided in the diet, and are absorbed by transporters in 
the gastrointestinal tract Plasma proteins transport the metal ions to the liver and other target organs, 
where specific transporters move the ions into cells and cellular organelles as needed. Imbalances in 

30 metal ion metabolism have been associated with a number of disease states (Dahks, D.M. (1986) J. 
Med. Genet 23:99-106). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
acid transport appears to occur via a high affinity, lo w capacity protein-mediated transport process. 
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Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane 
segments, is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as 
muscle, heart, and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose 
conversion, and expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et 
5 aL (1998) J. Biol. Chem. 273:27420-27429). 

Mitochondrial carrier proteins are txansmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoghitarate/malate carrier; the phosphate carrier protein; the pyruvate 
carrier; the dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the 

10 tricafboxylate carrier which transports citrate and malate; and the Grave's disease carrier protein, a 
protein recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting in 
hyperthyroidism. Proteins in this family consist of three tandem repeats of an approximately 100 
amino acid domain, each of which contains two transmembrane regions (Stryer, L. (1995) 
Biochemistry . W.BL Freeman and Company, New York NY, p. 551; PROSITE PDOC00189 

15 Mitochondrial energy transfer proteins signature; Online Mendelian Inheritance in Man (OMIM) 
*275000 Graves Disease). 

. This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation from 
ATP synthesis. The result is energy dissipation in the form of heat Mitochondrial uncoupling proteins 

20 have been implicated as modulators of thermoregulation and metabolic rate, and have been proposed 
as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et aL (1999) J. 
Int Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
25 ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 
selective pores within the membrane. There are two basic types of ion channels, ion transporters and 
gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient Gated ion channels allow passive flow of an 
ion down the ion's electrochemical gradient under restricted conditions. Together, these types of ion 
30 channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical impulse 
conduction down the axon of a nerve cell, 2) transport of molecules into cells against concentration 
gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell Utilizing the 
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energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient 
These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ -ATPase, and H + -ATPase, are activated by a 
phosphorylation event P-class ion transporters are responsible for maintaining resting potential 
5 distributions such that cytosolic concentrations of Na + and Ca 2+ are low and cytosolic concentration of 
K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular organelles, 
such as lysosomes and GolgL V-class ion transporters are responsible for generating the low pH 
within the lumen of these organelles that is required for function. The coupling factor (F) class 
consists of H + pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate 

10 ATP from ADP and inorganic phosphate (Pj). 

The P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 
several large cytoplasmic regions that may play a role in ion binding (Scarborough, G.A. (1999) Curr. 
Opia CellBioL 11:517-522). The V-ATPases are composed of two functional domains: the V! 
domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral complex 

15 responsible for proton translocation across the membrane. The F-ATPases are structurally and 
evohitionarily related to the V-ATPases. The F-ATPase F 0 domain contains 12 copies of the c 
subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a single 
buried carboxyl group in TM2 that is essential for proton transport The V-ATPase V 0 domain 
contains three types of homologous c subunits with four or five transmembrane domains and the 

20 essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a subunit that 
may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. BioL Chem. 
274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 
25 the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 

(symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 
molecule up a concentration gradient Similarly, cardiac muscle links transfer of Ca 2 * out of the cell 
with transport of Na + into the cell (antiport). 
Gated Ion Channels 

30 Gated ion channels control ion flow by regulating the opening and closing of pores. The ability 

to control ion flux through various gating mechanisms allows ion channels to mediate such diverse 
signaling and homeostatic functions as neuronal and endocrine signaling, muscle contraction, 
fertilization, and regulation of ion and pH balance. Gated ion channels are categorized according to 
the manner of regulating the gating function. Mechanically-gated chann Is open their pores in 
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response to mechanical stress; voltage-gated channels (e.g., Na + , K + , Ca 2 *, and Cl" channels) open 
their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 
acetylcholine-, serotonin-, and ghitamate-gated cation channels, and GABA- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 

5 The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
translational modifications, such as phosphorylation. 

Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 

10 contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently cloned 
from rat kidney. The SIC channel belongs to a group of channels which are activated by pressure or 
stress on the cell membrane and conduct both Ca 2+ and Na + (Suzuki, M. et aL (1999) J. Biol. Chem. 
274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 

15 channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 
carboxy termini. In the Na + and Ca 2 * subfamilies, this domain is repeated four times, while in the K + 
channel subfamily, each channel is formed from a tetramer of either identical or dissimilar subunits. 
The P region contains information specifying the ion selectivity for the channel In the case of K* 

20 channels, a GYG tripeptide is involved in this selectivity (Ishii, TM. et aL (1997) Proc. NatL Acad. 
Sci. USA 94:11651-11656). 

Voltage-gated Na + and K + channels are necessary for the function of electrically excitahle 
cells, such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and 
muscle contraction, arise from large, transient changes in the permeability of the membrane to Na + 

25 and K + ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + 
channels. Sodium ions flow into the cell, further depolarizing the membrane and opening more 
voltage-gated Na + channels, which propagates the depolarization down the length of the cell. 
Depolarization also opens voltage-gated potassium channels. Consequently, potassium ions flow 
outward, which leads to repolarization of the membrane. Voltage-gated channels utilize charged 

30 residues in the fourth transmembrane segment (S4) to sense voltage change. The open state lasts 
only ahout 1 millisecond, at which time the channel spontaneously converts into an inactive state that 
cannot be opened irrespective of the membrane potential. Saactivation is mediated by the channel's 
N-terminus, which acts as a plug that closes the pore. The transition from an inactive to a closed state 
requires a return to resting potential 
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Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore- 
forming a subunit that associates with two smaller auxiliary subunits, 01 and |}2. The 02 subunit is a 
integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a and 
01 subunits correlates with increased functional expression of the channel, a change in its gating 
5 properties, as well as an increase in whole cell capacitance due to an increase in membrane surface 
area (bom, L.L. et aL (1995) Cell 83:433-442). 

Non voltage-gated Na + channels include the members of the amiloride-sensitive Na + 
channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 
transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini located 

10 within the eel The NaC/DEG family includes the epithelial Na + channel (ENaC) involved in Na + 
reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the kidney, and 
exocrine duct glands. Mutations in ENaC result in pseudohypo aldosteronism type 1 and liddle's 
syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the recently characterized 
H + -gated cation channels or acid-sensing ion channels (ASIC). ASIC subunits are expressed in the 

15 brain and form heteromultimeric Na + -penneable channels. These channels require acid pH 

fluctuations for activation. ASIC subunits show homology to the degenerins, a family of mechanically- 
gated channels originally isolated from C. elegans. Mutations in the degenerins cause 
neurodegeneratioiL ASIC subunits may also have a role in neuronal function, or in pain perception, 
since tissue acidosis causes pain (Waldmann, IL and M. Lazdunski (1998) Curr. Opin. NeurobioL 

20 8:418-424; Eglen, R.M. et aL (1999) Trends Pharmacol Sci. 20:337-342). 

K + channels are located in all cell types, and may be regulated by voltage, ATP concentration, 
or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels are involved in 
protein synthesis, control of endocrine secretions, and the maintenance of osmotic equilibrium across 
membranes. In neurons and other excitable cells, in addition to regulating action potentials and 

25 repolarizing membranes, K + channels are responsible for setting resting membrane potential The 
cytosol contains non-diffusible anions and, to balance this net negative charge, the cell contains a Na + - 
K + pump and ion channels that provide the redistribution of Na + , K + , and Or. The pump actively 
transports Na + out of the cell and K + into the cell in a 3 :2 ratio. Ion channels in the plasma membrane 
allow K + and CI " to flow by passive diffusion. Because of the high negative charge within the cytosol, 

30 CI" flows out of the cell The flow of K + is balanced by an electromotive force pulling K + into the cell, 
and a K + concentration gradient pushing K + out of the eel Thus, the resting membrane potential is 
primarily regulated by K + flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489^92). 

Potassium channel subunits of the Shaker -like superfamily all have the characteristic six 
transmembrane/! pore domain structure. Four subunits combine as homo- or heterotetramers to form 



7 



WO 02/22684 



PCT/US01/28938 



functional K channels. These pore-forming subunits also associate with various cytoplasmic P 
subunits that alter channel inactivation kinetics. The Shaker -like channel family includes the voltage- 
gated K + channels as well as the d layed rectifier type channels such as the human ether-a-go-go 
related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Curran, M.E. (1998) 
5 Curr. Opin. BiotechnoL 9:565-572; Kaczorowski, G J. and MX. Garcia (1999) Curr. Opin. Chem. 
Biol. 3:448-458). 

A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 
Kir channels have the property of preferentially conducting K + currents in the inward direction. These 
proteins consist of a single potassium selective pore domain and two transmembrane domains, which 

10 correspond to the fifth and sixth transmembrane domains of voltage-gated K + channels. Kir subunits 
also associate as tetramers. The Kir family includes ROMK1, mutations in which lead to Bartter 
syndrome, a renal tubular disorder. Kir channels are also involved in regulation of cardiac pacemaker 
activity, seizures and epilepsy, and insulin regulation (Doupnik, C.A. et aL (1995) Curr. Opin. 
NeurobioL 5:268-277; Curran, supra) . 

15 The recently recognized TWIK K + channel family includes the mammalian TWIK-1, TREK-1 

and TASK proteins. Members of this family possess an overall structure with four transmembrane 
domains and two P domains. These proteins are probably involved in controlling the resting potential 
in a large set of cell types (Duprat, R et al (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 2 * channels have been classified into several subtypes based upon their 

20 electrophysiological and pharmacological characteristics. L-type Ca 2+ channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 
coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 
channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-type and N-type voltage-gated Ca 2+ channels have been purified and, though their 

25 functions differ dramatically, they have similar subunit compositions. The channels are composed of 
three subunits. The 04 subunit forms the membrane pore and voltage sensor, while the afi and f) 
subunits modulate the voltage-dependence, gating properties, and the current amplitude of the channel 
These subunits are encoded by at least six 04, one 0^5, and four P genes. A fourth subunit, y 9 has 
been identified in skeletal muscle (Walker, D. et aL (1998) J. BioL Chem. 273:2361-2367; Mcdeskey, 

30 E.W. (1994) Curr. Opin. NeurobioL 4:304-312). 

The transient receptor family (Trp) of calcium ion channels are thought to mediate 
capacitative calcium entry (CCE). CCE is the Ca 2 * influx into cells to resupply Ca 2 * stores depleted 
by the action of inositol triphosphate (IP3) and other agents in response to numerous hormones and 
growth factors. Trp and Trp-like were first cloned from Drosophila and have similarity to voltage 
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gated Ca2+ channels in the S3 through S6 regions. This suggests that Tip and/or related proteins may 
form mammalian CCC entry channels (Zhu, X. et aL (1996) Cell 85:661-671; Boulay, G. et al. (1997) 
J. Biol. Chem. 272:29672-29680). Melastatin is a gene isolated in both the mouse and human, and 
whose expression in melanoma cells is inversely correlated with melanoma aggressiveness in vivo. 
5 The human cDNA transcript corresponds to a 1533-amino acid protein having homology to members 
of the Trp family. It has been proposed that the combined use of malastatin mRNA expression status 
and tumor thickness might allow for the determination of subgroups of patients at both low and high 
risk for developing metastatic disease (Duncan, KM, et al (2001) J. Clin. Oncol. 19:568-576). 

Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 

10 organelle pH. In secretory epithelial cells, CI" enters the cell across a basolateral membrane through 
an Na + , K7CT cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI" from the apical surface, in response to hormonal stimulation, leads to 
flow of Na + and water into the secretory lumen. The cystic fibrosis transmembrane conductance 
regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic 

15 disorder in humans. CFTR is a member of the ABC transporter family, and is composed of two 

domains each consisting of six transmembrane domains followed by a nucleotide-binding site. Loss of 
CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus that coat 
the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. The resulting 
blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and devastating "chronic 

20 obstructive pulmonary disease" (Al-Awqati, Q. et aL (1992) J. Exp. BioL 172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 
domains, as well as two small globular domains known as CBS domains. The CLC subunits probably 
function as homotetramers. CLC proteins are involved in regulation of cell volume, membrane 
potential stabilization, signal transduction, and transepithelial transport Mutations in CLC-1, expressed 

25 predominantly in skeletal muscle, are responsible for autosomal recessive generalized myotonia and 
autosomal dominant myotonia congenita, while mutations in the kidney channel CLC-5 lead to kidney 
stones (Jentsch, TJ. (1996) Curr. Opin. NeurobioL 6:303-310). 

ligand-gated channels open their pores when an extracellular or intracellular mediator binds to 
the channeL Neurotransmitter-gated channels are channels that open when a neurotransmitter binds 

30 to their extracellular domain. These channels exist in the postsynaptic membrane of nerve or muscle 
cells. There are two types of neurotransmitter-gated channels. Sodium channels open in response to 
excitatory neurotransmitters, such as acetylcholine, ghitamate, and serotonin. This opening causes an 
influx of Na + and produces the initial localized depolarization that activates the voltage-gated channels 
and starts th action potential. Chloride channels open in response to inhibitory neurotransmitters, 

9 



WO 02/22684 



PCT/US01/28938 



such as y-aminobutyric acid (GABA) and glycine, leading to hyperpolarization of th membrane and 
the subsequent generation of an action potential. Neurotransmitter-gated ion channels have four 
transmembrane domains and probably function as pentamers (Jentsch, supra). Amino acids in the 
second transmembrane domain appear to be important in determining channel permeation and 
5 selectivity (Sather, W. A. et aL (1994) Curr. Opin. NeurobioL 4:3 13-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an infhx of calcium 
during depolarization opens K + channels to modulate the magnitude of the action potential (Ishi et aL, 
supra) . The large conductance (BK) channel has been purified from brain and its subunit composition 

10 determined. The a subunit of the BK channel has seven rather than six transmembrane domains in 
contrast to voltage-gated K + channels. The extra transmembrane domain is located at the subunit N- 
teiminus. A 28-amino-acid stretch in the C-terminal region of the subunit (the "calcium bowl" region) 
contains many negatively charged residues and is thought to be the region responsible for calcium 
binding. The P subunit consists of two transmembrane domains connected by a glycosylated 

15 extracellular loop, with intracellular N- and C-termini (KaczorowsM, supra ; Vergara, C. et aL (1998) 
Cuit. Opin. NeurobioL 8:321-329). 

Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 
examples of these are the cAMP-gated Na + channels involved in olfaction and the cGMP-gated 
cation channels involved in vision. Both systems involve ligand-mediated activation of a G-protein 

20 coupled receptor which then alters the level of cyclic nucleotide within the celL CNG channels also 
represent a major pathway for Ca 2 * entry into neurons, and play roles in neuronal development and 
plasticity. CNG channels are tetramers containing at least two types of subunits, an a subunit which 
can form functional homomeric channels, and a P subunit, which modulates the channel properties. 
All CNG subunits have six transmembrane domains and a pore forming region between the fifth and 

25 sixth transmembrane domains, similar to voltage-gated K + channels. A large C-terminal domain 
contains a cyclic nucleotide binding domain, while the N-terminal domain confers variation among 
channel subtypes (Zufoll, F. et aL (1997) Curr. Opin. NeurobioL 7:404-412). 

The activity of other types of ion channel proteins may also be modulated by a variety of 
intracellular signalling proteins. Many channels have sites for phosphorylation by one or more protein 

30 kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all of which 
regulate ion channel activity in cells. Kir channels are activated by the binding of the GPy subunits of 
heterotrimeric G-proteins (Reimann, F. and RM. Ashcroft (1999) Curr. Opin. CelL BioL 11:503-508). 
Other proteins are involved in the localization of ion channels to specific sites in the cell membrane. 
Such proteins include the PDZ domain proteins known as MAGUKs (membrane-associated guanylate 
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kinases) which regulate the clustering of ion channels at neuronal synapses (Craven, S.E. and D.S. 
Bredt (1998) Cell 93:495-498). 
Disease Correlation 

The etiology of numerous human diseases and disorders can he attributed to defects in the 
5 transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g., cystic fibrosis, ghicose-galactose 
malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (vant Hoff, 

10 W.G. (1996) Exp. Nephrol 4:253-262; Talente, G.M. et aL (1994) Ann. Intern. Med. 120:218-226; 
and Chfflon, M. et al (1995) New EngL J. Med. 332:1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of 
sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 

15 contraction is delayed. Sodium channel myotonias have been treated with channel blockers. 

Mutations in muscle sodium and calcium channels cause forms of periodic paralysis, while mutations in 
the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle sodium channel 
cause malignant hyperthermia. Cardiac airythmia disorders such as the long QT syndromes and 
idiopathic ventricular fibrillation are caused by mutations in potassium and sodium channels (Cooper, 

20 E.C and L. Y. Jan (1998) Proc. Nad. Acad. ScL USA 96:4759-4766). All four known human 

idiopathic epilepsy genes code for ion channel proteins (Berkovic, S.F. and LE. Scheffer (1999) Curr. 
Opin. Neurology 12:177-182). Other neurological disorders such as ataxias, hemiplegic migraine and 
hereditary deafness can also result from mutations in ion channel genes (Jen, J. (1999) Curr. Opin. 
NeurobioL 9:274-280; Cooper, supra) . 

25 Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 

have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol 39:47-98). 
Various classes of ion channels also play an important role in the perception of pain, and thus are 

30 potential targets for new analgesics. These include the vanffloid-gated ion channels, which are 
activated by the vaniHoid capsaicin, as well as by noxious heat Local anesthetics such as lidocaine 
and mexiletine which blockade voltage-gated Na + channels have been useful in the treatment of 
neuropathic pain (Egjen, supra). 

Ion channels in the immune system have recently been suggested as targets for 
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inununomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agents can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide 
antagonist of the T-cell potassium channel Kvl.3 was found to suppress delayed-type hypersensitivity 
5 and allogenic responses in pigs, validating the idea of channel blockers as safe and efficacious 
immunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding them, 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of transport, neurological, muscle, immunological, and cell proliferative disorders, and in 
10 the assessment of the effects of exogenous compounds on the expression of nucleic acid and amino 
acid sequences of transporters and ion channels. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 

15 collectively as "TRICIT and individually as "TRICH-1" *TRIGH-2, w< TRICH-3," "IKIC&4," 
'TRICH-5," 'TRICH-6/' "TRICH-7" "TRICH-8," "TRICH-9," "TRIGH-10," "TRICE-ll," 
"TRICH-12," *TRICH~13," C TRICH-14," "TRICH-15" 4 TRICH-16," "TRICH-17" "TRICH-18," 
"TRIGH-19," ''TRICH-20," 'TRICH-21," c TRICH-22," *TRICH-23," 'TRICH-24," 'TRICH-25," 
and "TRICH-26/' In one aspect, the invention provides an isolated polypeptide selected from the 

20 group consisting of a) a polypeptide comprising an amino acid sequence selected from die group 

consisting of SEQ ID NO: 1-26, b) a polypeptide comprising a naturally occurring amino acid sequence 
at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
26, c) a biologically active fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-26, and d) an immunogenic fragment of a polypeptide having an 

25 amino acid sequence selected from the group consisting of SEQ ID NO: 1-26. In one alternative, the 
invention provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 1-26. 

Hie invention further provides an isolated polynucleotide encoding a polypeptide selected from 
the 1 group consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-26, b) a polypeptide comprising a naturally occurring amino acid sequence 

30 at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1- 
26, c) a biologically active fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-26, and d) an immunogenic fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-26. In ne alternative, the 
polynucleotide encodes a polypeptide selected from the group consisting of SEQ ID NO: 1-26. In 
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another alternative, the polynucleotide is selected from the group consisting of SEQ ID NO:27-52. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
5 of SEQ ID NO: 1-26, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-26, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-26, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO:l~26. In one alternative, the 

10 invention provides a cell transformed with the recombinant polynucleotide. In another alternative, the 
invention provides a transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-26, b) a polypeptide comprising a naturally occurring amino acid sequence at least 

15 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l-26, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-26, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-26. The method comprises a) 
culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is 

20 transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-26, b) a polypeptide comprising a naturally 

25 occurring amino acid sequence at least 90% identical to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-26, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-26, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
IDNO:l-26. 

30 The invention further provides an isolated polynucleotide selected from the group consisting of 

a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:27-52, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID N027-52, 
c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to 
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the polynucleotide of b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting of 
5 a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:27-52, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:27-52, 
c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to 
the polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 

10 sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 

complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 
said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 

15 comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide having a sequence of a polynucleotide selected from the group consisting of a) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:27-52, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 

20 identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:27-52, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 

25 optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-26, b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical to an amino acid sequence selected from the group 

30 consisting of SEQ ID NO:l-26, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-26, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-26, 
and a pharmaceutically acceptable excipient In one embodiment, the composition comprises an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-26. The invention additionally 



14 



WO 02/22684 



PCT/US01/28938 



provides a method of treating a disease or condition associated with decreased expression of 
functional TRIOS, comprising administering to a patient in need of such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
5 acid sequence selected from the group consisting of SEQ ID NO:l-26, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from die group consisting of SEQ ID NO:l-26, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-26, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 

10 consisting of SEQ ID NO:l-26. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutical^ acceptable excipient In another alternative, the invention provides a method of 
treating a disease or condition associated with decreased expression of functional TRICH, comprising 

15 administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-26, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid 

20 sequence selected from the group consisting of SEQ ID NO:l-26, c) a biologically active fragment of 
a polypeptide having an amino acid sequence selected from die group consisting of SEQ ID NO:l-26, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-26. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting antagonist activity in the sample, hi one alternative, the 

25 invention provides a composition comprising an antagonist compound identified by the method and a 
pharmaceuticaHy acceptable excipient In another alternative, the invention provides a method of 
treating a disease or condition associated with overexpression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 

The invention further provides a method of screening for a compound that specifically binds to 

30 a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-26, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-26, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-26, and d) an 
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immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-26. Hie method comprises a) combining the polypeptide with at least one 
test compound under suitable conditions, and b) detecting binding of the polypeptide to the test 
compound, thereby identifying a compound that specifically binds to the polypeptide. 

5 The invention further provides a method of screening for a compound that modulates the 

activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-26, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-26, c) a biologically active fragment of a polypeptide 

10 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-26, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-26. The method comprises a) combining the polypeptide with at least one 
test compound under conditions permissive for the activity of the polypeptide, b) assessing the activity 
of the polypeptide in the presence of the test compound, and c) comparing the activity of the 

15 polypeptide in the presence of the test compound with the activity of the polypeptide in the absence of 
the test compound, wherein a change in the activity of the polypeptide in the presence of the test 
compound is indicative of a compound that modulates the activity of tie polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 

20 polynucleotide sequence selected from the group consisting of SEQ ID NO:27-52, the method 

comprising a) exposing a sample comprising the target polynucleotide to a compound, and b) detecting 
altered expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 

25 hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:27-52, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:27-52, iii) a polynucleotide 

30 having a sequence complementary to i), iv) a polynucleotide complementary to the polynucleotide of 
ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotid selected from the group consisting of i) a polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:27-52, ii) a 
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polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SBQ ID NO:27-52, iii) a polynucleotid 
complementary to the polynucleotide of i), iv) a polynucleotide complementary to the polynucleotide of 
ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment 
5 of a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amount of hybridization complex in the treated 
biological sample with the amount of hybridization complex in an untreated biological sample, wherein 
a difference in the amount of hybridization complex in the treated biological sample is indicative of 
toxicity of the test compound. 

10 

BRIEF DESCRIPTION OP THE TABLES 
Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the present invention* 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
15 homolog for polypeptides of the invention. The probability score for the match between each 
polypeptide and its GenBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used for 
analysis of the polypeptides. 
20 Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 

polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the invention. 
Table 6 provides an appendix which describes the tissues and vectors used for construction of 
25 the cDNA libraries shown in Table 5. 

Table 7 shows the took, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

30 Before the present proteins, nucleotide sequences, and methods are described, it is understood 

that this invention is not limited to the particular machines, materials and methods described, as these 
ma/vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 
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It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host ceQ" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
5 forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
10 described. All publications mentioned herein are cited for the purpose of describing and disclosing the 
cell lines, protocols, reagents and vectors which are reported in the publications and which might be 
used in connection with the invention Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

15 "TRIQET refers to the amino acid sequences of substantially purified TRICK obtained from 

any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
TRICK Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 

20 compound or composition which modulates the activity of TRICH either by directly interacting with 
TRICH or by acting on components of the biological pathway in which TRICH participates. 

An "allelic variant?' is an alternative form of the gene encoding TRICH. Allelic variants may 
result from at least one mutation in the nu cleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 

25 many allelic variants of its naturally occurring form Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence: 

"Altered'* nucleic acid sequences encoding TRICH include those sequences with deletions, 
30 insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or a 
polypeptide with at least one functional characteristic of TRICH Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encoding TRICH. 
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The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 
amino acid residues which produce a silent change and result in a functionally equivalent TRICH 
D liberate amino acid substitutions may be mad on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological 
5 or immunological activity of TRICH is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 
include: asparagine and ghitamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 

10 and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 

15 to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PGR) technologies well known 
in the art 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
20 of TRICH. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TRICH either by 
directly interacting with TRICH or by acting on components of the biological pathway in which 
TRICH participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
25 thereof, such as Fab, F(ab , ) 2 , and Fv fragments, which are capable of binding an epitopic determinant 
Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, 
or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used 
30 carriers that are chemically coupled to peptides include bovine serum albumin, thyreoglobulin, and 
keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinanf ' refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
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which bind specifically to antigenic determinants (particular regions or three-dimensional structures on 
the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used 
to licit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
5 specific molecular target Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 
5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions maybe double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. The 

10 nucleotide components of an aptamer may have modified sugar groups (e.g., the 2-OH group of a 
ribonucleotide may be replaced by 2-F or 2 , -NH 2 ), which may improve a desired property, e.g., 
resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 

15 cross-linker. (See, e.g., Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13.) 

The term "intramer" refers to an aptamer which is expressed in vivo . For example, a vaccinia 
virus-based RNA expression system has been used to express specific RNA aptamers at high levels 
in the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl Acad. SdL USA 96:3606-3610). 
The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 

20 handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which act on right-handed 
nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 

25 peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 

phosphorothioates, mefhylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2-methoxyethyl sugars or 2 -methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2 , -deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 

30 introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
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functions of a naturally occurring molecule. Likewise, "immunologically active" or 'Immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TRICH, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

5 "Complementary" describes the relationship between two single-stranded nucleic acid 

sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
3-TCA-5'. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 

10 amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 

Compositions comprising polynucleotide sequences encoding TRICH or fragments of TRICH may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 

15 SDS), and other components (e.g., Denhaidt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to 
repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 

20 program for fragment assembly, such as the GELVDEW fragment assembly system (GCG, Madison 
WI) or Ehrap (University of Washington, Seattle WA). Some sequences have been both extended 
and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 

25 protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 

Original Residue Conservative Substitution 

Ala Gly,Ser 

30 Arg BQs,Lys 

Asn Asp, Gin, His 

Asp Asn, Ghi 

Cys Ala, Ser 

Gin Asn, Ghi, IBs 

35 Ghi Asp, Gin, His 

Gly Ala 

His Asn, Arg, Gin, Gh 
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lie 

Leu 

Lys 

Met 

Phe 

Ser 

Thr 

Tip 

Tyr 

Val 



Leu, Val 
Be, Val 



5 



Arg, Gin, Ghx 
Leu, lie 

His, Met, Leu, Tip, Tyr 



Cys,Thr 
Ser, Val 
Phe, Tyr 



10 



ESs,Fhe,Trp 
He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
15 the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
20 alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 



25 measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

'Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

30 "Exon shuffling" refers to the recombination of different coding regions (exons). Since an 

exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 



35 identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 



A "detectable labeF refers to a reporter molecule or enzyme that is capable of generating a 



A "fragment" is a unique portion of TRICK or the polynucleotide encoding TRICH which is 
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used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 
16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides r amino acid 
residues in length. Fragments may be preferentially selected from certain regions. of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
5 from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence listing, tables, and figures, maybe encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:27-52 comprises a region of unique polynucleotide sequence that 
10 specifically identifies SEQ ID NO:27-52, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SBQ ID NO:27-52 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish SEQ 
ID NO:27-52 from related polynucleotide sequences. The precise length of a fragment of SEQ ID 
NO:27-52 and the region of SEQ ID NO:27-52 to which the fragment corresponds are routinely 
15 determinable by one of ordinary skill in the art based on the intended purpose for the fragment 

A fragment of SEQ ID NO:l-26 is encoded by a fragment of SEQ ID NO:27-52. A 
fragment of SEQ ID NO:l-26 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID NO:l-26. For example, a fragment of SEQ ID NO.1-26 is useful as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-26. 
20 The precise length of a fragment of SEQ ID NO:l-26 and the region of SEQ ID NO:l-26 to which 
the fragment corresponds are routinely determinable by one of ordinary skill in the art based on the 
intended purpose for the fragment 

A "full length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codon. A '*full 
25 length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms '^percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
30 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
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sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Mggins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in H&ggins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
5 follows: Ktuple=2, gap penalty^, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default Percent identity is reported by CLUSTAL V as the tc percent , 
similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
10 Search Tool (BLAST) (Altschul, S.F. et aL (1990) J. MoL BioL 215:403-410), which is available from 
several sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbLnhn-nih.gov/BLASTA The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called ''BLAST 2 
15 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. 'BLAST 2 

Sequences" can be accessed and used interactively at http://www.ncbi.nlmnih.gov/go The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
20 2.0. 12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 
25 Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
30 as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
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length over which percentage identity may b measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
5 sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
10 explained in more detail above, generally preserve the charge andjiydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
15 polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=:l, gap 
penalty^, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the '^percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
20 comparison of two polypeptide sequences, one may use the te BLAST 2 Sequences" tool Version 
2.0. 12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
25 Gap x drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
30 for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, maybe 
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used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 
5 The term "humanized antibody* refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 

10 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the '^washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, Le., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 

IS conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in 
the art and maybe consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 /ig/ml sheared, denatured salmon sperm DNA. 

20 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

under which the wash step is carried out Such wash temperatures are typically selected to be about . 
5°C to 20°C lower than the thermal melting point (T^) for the specific sequence at a defined ionic 
strength and pH. The T ra is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 

25 conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et.aL (1989) 
Molecular H nninp ;: A Laboratory Manual 2 nd ed., voL 1-3, Cold Spring Harbor Press, Plainview NY; 
specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 

30 Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 /xg/mL Organic solvent, such as 
formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
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such as for RNADNA hybridizations. Useftil variations on these wash conditions win be readily 
apparent to those of ordinary skill in the art Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongjy indicative of a similar role for the nucleotides and their encoded polypeptides. 
5 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., Cot or R^ analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
10 to which cells or their nucleic acids have been fixed). 

Hie words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

'Immune response'* can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
15 of various factors, e.g., cytokines, chemokincs, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
20 TRICH which is useful in any of the antibody production methods disclosed herein or known in the art 
The term "microarray^ refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element* * refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a micro array. 
25 The term "modulate" refers to a change in the activity of TRICH. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of TRICH 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
30 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
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sequence. OperaMy linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

'Teptide nucleic acid" (PNA) refers to an antisens molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
5 amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

*Tost-translational modification' 9 of an TRICH may involve Hpidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 

10 art. These processes may occur synthetically or biochemically. Biochemical modifications will vary 
by cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 

15 Typical labels include radioactive isotopes, ligands, chemDhiminescent agents, and enzymes. "Primers" 
are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

20 Probes and primers as used in the present invention typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 

25 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et aL (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, RM. et al (1987) Current Protocols in Molecular 
Biology , Greene PubL Assoc. & Wiley-Intersciences, New Yoik NY; Innis, M. et aL (1990) PCR 

30 Protocols, A Guide to Methods and Applications. Academic Press, San Diego CA. PCR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
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purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
5 primer selection program (available to the public from the Genome Center at University of Texas 
South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome-wide scope. The Prhner3 primer 
selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a ''mispriming library," in which sequences to 

10 avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 
oligonucleotides for micro arrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 

15 selection of primers that hybridize to either the most conserved or least conserved regions of aligned 
nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for example, 
as PCR or sequencing primers, micro array elements, or specific probes to identify fully or partially 

20 complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are 
not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 

25 artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

30 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammaL 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
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(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
5 chennluminescent, or bhromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
10 instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 
15 The terms "specific binding" and "specifically binding" refer to that interaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g. , the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the 
20 epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the 
antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with 
25 which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
30 microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to die collective pattern of gene expression by a particular cell 
type or tissu under given conditions at a given time. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
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eel Transfonnation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or viral 

5 infection, electroporation, heat shock, lipofection, and particle bombardment The term "transformed 
cells" includes stably transformed cells in which the inserted DNA is capable of replication either as 
an autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 

10 and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 

introduced by way of human intervention, such as by transgenic techniques well known in the art The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. Hie term genetic manipulation does not include classical cross-breeding, or in vitro 

15 fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The transgenic 
organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, 
fungi, plants and animals. The isolated DNA of die present invention can be introduced into the host 
by methods known in the art, for example infection, transfection, transformation or transconjugation. 
Techniques for transferring the DNA of the present invention into such organisms are widely known 

20 and provided in references such as Sambrook et al (1989), supra . 

A 'Variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 

25 least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 

30 polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides will generally have significant amino acid identity relative to each 
other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
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between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. Hie 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 
5 A 'Variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 93%, at least 
10 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

THE INVENTION 

The invention is based on the discovery of new human transporters and ion channels 
15 (TRICH), the polynucleotides encoding TRIGS, and the use of these compositions for the diagnosis, 
treatment, or prevention of transport, neurological, muscle, immunological, and cell proliferative 
disorders. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to a 

20 single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is denoted 
by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incyte 
polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide sequence is 
denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID NO:) and an 
Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 

25 Table 2 shows sequences with homology to the polypeptides of the invention as identified by 

BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 
polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. Column 3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. 

30 Column 4 shows the probability score for the match between each polypeptide and its GenBank 
homolog. Column 5 shows the annotation of the GenBank homolog along with relevant citations 
where applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 and 
2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
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p lypeptide sequence number (Incyte Polypeptide ID) for eadi polypeptide of the invention. Column 
3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the MOTIFS 
program of the GCG sequence analysis software package (Genetics Computer Group, Madison WI). 
5 Column 6 shows amino acid residues comprising signature sequences, domains, and motifs. Column 7 
shows analytical methods for protein structure/function analysis and in some cases, searchable 
databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are transporters and ion channels. For example, 

10 SEQ ID NO:2 is 94% identical from amino acids 965 through 2436 to mouse abc2 transporter 
(GenBank ID g495259) as determined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is 0.0, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO:2 also contains two ABC 
transporter domains as determined by searching for statistically significant matches in the hidden 

15 Markov model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) 
Data from MOTIFS, and FROFILESCAN analyses provide further corroborative evidence that SEQ 
ID NO:3 is an ABC transporter. In an alternate example, SEQ ID NO:13 is 97% identical to human 
gamma subunit precursor of muscle acetylcholine receptor (GenBank ID g825618) as determined by 
the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 

20 3.0e-273, which indicates the probability of obtaining the observed polypeptide sequence alignment by 
chance. SEQ ID NO: 13 also contains a neurotransmitter-gated ion-channel domain as determined by 
searching for statistically significant matches in the hidden Markov model (HMM)-based PFAM 
database of conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and 
PROFILES CAN analyses provide further corroborative evidence that SEQ ID NO:13 is a 

25 neurotransmitter-gated ion-channel protein. In an alternate example, SEQ ID NO:19 is 62% identical 
to human vacuolar proton-ATPase (GenBank ID g37643) as determined by the Basic Local ■ 
Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 3.2e-129, which 
indicates the probability of obtaining the observed polypeptide sequence alignment by chance. Data 
from BLAST analyses provide further corroborative evidence that SEQ ID NO: 19 is a vacuolar ATP 

30 synthase. In an alternate exampple, SEQ ID NO:22 is 94% identical to rat GABA(A) receptor 
gamma- 1 subunit (GenBank ID §56176) as determined by the Basic Local Alignment Search Tool 
(BLAST). (See Table 2.) The BLAST probability score is 4.4e-244, which indicates the probability 
f obtaining the observed polypeptide sequence alignment by chance. SEQ ID N022 also contains a 
neurotransmitter-gated ion channel domain as determined by searching for statistically significant 
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matches in the hidden Markov model (HMM)-based PFAM databas of conserved protein family 
domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide 
further corroborative evidence that SEQ ID NO:22 is a neurotransmitter-gated ion channel In an 
alternate example, SEQ ID NO:26 is 61% identical to rabbit peroxisomal Ca-dependent solute carrier 
5 (GenBank ID £2352427) as determined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is 6.4e-156, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO:26 also contains three 
mitochondrial carrier protein domains, as well as three EF hand domains, as determined by searching 
for statistically significant matches in the hidden Markov model (HMM>based PFAM database of 

10 conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and 

PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO:26 is a calcium 
dependent carrier protein. In an alternate examplpe, SEQ ID NO:17 is 69% identical to Ambystoma 
tigrinum electrogenic NaHC0 3 cotransporter (GenBank ID g2 198815) as determined by the Basic 
Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, which 

15 indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ 
ID NO:17 also contains an HC0 3 transporter family domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and 
PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO:17 is an anion 

20 transporter. SEQ ID NO:l, SEQ ID NO:3-12, SEQ ID NO:14-16, SEQ ID NO:18, and SEQ ID 
NO:20-25 were analyzed and annotated in a similar manner. The algorithms and parameters for the 
analysis of SEQ ID NO:l-26 are described in Table 7. 

As shown hi Table 4, the full length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

25 combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments of 
the polynucleotide sequences which are useful, for example, in hybridization or amplification 

30 technologies that identify SEQ ID NO:27-52 or that distinguish between SEQ ID NO:27-52 and 
related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
polynucleotide sequences of the invention. Columns 6 and 7 of Table 4 show the nucleotide start (5') 
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and stop (3') positions of the cDNA and/or genomic sequences in column 5 relative to their respective 
full length sequences. 

The identification numbers in Column 5 of Table 4 may refer specifically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 7251266F7 is the 

5 identification number of an Incyte cDNA sequence, and PROSTMY01 is the cDNA library from 
which it is derived. Incyte cDNAs for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries (e.g., 70564238V1). Alternatively, the identification numbers in column 5 may 
refer to GenBank cDNAs or ESTs (e.g., g4689801) which contributed to the assembly of the full 
length polynucleotide sequences. In addition, the identification numbers in column 5 may identify 

10 sequences derived from the ENSEMBL (The Sanger Centre, Cambridge, UK) database (ie. 7 those 
sequences including the designation "ENST"). Alternatively, the identification numbers in column 5 
may be derived from the NCBI RefSeq Nucleotide Sequence Records Database (L e. , those 
sequences including the designation "NM" or "NT') or the NCBI RefSeq Protein Sequence Records 
(Le. , those sequences including the designation "NP"). Alternatively, the identification numbers in 

15 column 5 may refer to assemblages of both cDNA and Genscan-predicted exons brought together by , 
an "exon stitching" algorithm. For example, FLJOOOOOCJf 1 Jf 2 jnrYYY_N 3 JNf 4 represents a 
"stitched" sequence in which XXXXXX is the identification number of the cluster of sequences to 
which the algorithm was applied, and YYYYYis the number of the prediction generated by the 
algorithm, and N Jf2r3 _, if present, represent specific exons that may have been manually edited during 

20 analysis (See Example V). Alternatively, the identification numbers in column 5 may refer to 
assemblages of exons brought together by an "exon-stretching" algorithm. For example, 
FLXXXXXX^gAAAAA __gBBBBB_l_N is the identification number of a "stretched" sequence, with 
XXXXXX being the Incyte project identification number, gAAAAA being the GenBank identification 
number of the human genomic sequence to which the "exon-stretehingT algorithm was applied, 

25 gPBBBB being the GenBank identification number or NCBI RefSeq identification number of the 
nearest GenBank protein homolog, and N referring to specific exons (See Example V). In instances 
where a RefSeq sequence was used as a protein homolog for the "exon-stretching" algorithm, a 
RefSeq identifier (denoted by "NM " "NP," or "NT") may be used in place of the GenBank identifier 
(Le.,gPBBBB). 

30 Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 

genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 
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Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK) 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
column 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte 

10 cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide 
sequences which were assembled using Incyte cDNA sequences. The representative cDNA library 
is the Incyte cDNA library which is most frequently represented by the Incyte cDNA sequences 
which were used to assemble and confirm the above polynucleotide sequences. The tissues and 

15 vectors which were used to construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses TRICH variants. A preferred TRICK variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TRICH amino acid sequence, and which contains at least one functional or 
structural characteristic of TRICH. 

20 The invention also encompasses polynucleotides which encode TRICH. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:27-52, which encodes TRICH. The polynucleotide 
sequences of SEQ ID NO:27-52, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 

25 sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TRIOBL A particular aspect of the invention encompasses a variant of a 

30 polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID N027- 
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52 which has at least about 70%, or alternatively at least about 85%, or even at least about 95% 
polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of SEQ 
ID NO:27-52. Any one of the polynucleotide variants described above can encode an amino acid 
sequence which contains at least one functional or structural characteristic of TRICR 
5 It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

genetic code, a multitude of polynucleotide sequences encoding TRICE, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 

10 combinations are made in accordance with the standard triplet genetic code as applied to the 

polynucleotide sequence of naturally occurring TRICH, and all such variations are to be considered as 
being specifically disclosed. 

Although nucleotide sequences which encode TRICH and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately selected 

15 conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TRICH or 
its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase die rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host Other reasons for substantially altering the nucleotide 

20 sequence encoding TRICH and its derivatives without altering the encoded amino acid sequences 

include the production of RNA transcripts having more desirable properties, such as a greater half-life, 
than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 
TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 

25 synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 

30 NO:27-52 and fragments thereof under various conditions of stringency, (See, e.g., WahL, G.M. and 
S.L. Berger (1987) Methods EnzymoL 152:399-407; Kimmel, A.R (1987) Methods EnzymoL 152:507- 
511.) Hybridization conditions, including annealing and wash conditions, are described in 'Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
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of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferahly, sequence preparation is 

5 automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 fliennal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 
(Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA 
sequencing system (Applied Biosystems), the MEGABACE 1000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA), or other systems known in the art The resulting sequences 

10 are analyzed using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, P.M. 
(1997) Short Protocols in Molecular Biology . John Wiley & Sons, New York NY, unit 7.7 ; Meyers, 
RJl. (1995) Molecular Biology and Biotechnology, Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 

15 such as promoters and regulatory elements. For example, one method which may he employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PGR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 

20 a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et aL (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et 
aL (1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may he used to insert an engineered double-stranded sequence into a region of unknown 

25 sequence before performing PCR Other methods which may be used to retrieve unknown sequences 
are known in the art (See, e.g., Parker, J.D. et al (1991) Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERHNDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 

30 commercially available software, such as OLIGO 4.06 primer analysis software (National 

Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, 
to have a GC content of about 50% or more, and to anneal to the template at temperatures of about 
68°Cto72°C 
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When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which ften include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a fiiD-length cDNA. Genomic libraries may be useful for extension of sequence 
5 into 5* non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available maybe used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

10 emitted wavelengths. Output/U^t intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

15 In another embodiment of the invention, polynucleotide sequences or fragments thereof which 

encode TRICH may be cloned in recombinant DNA molecules that direct expression of TRICH, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 
of the genetic code, other DNA sequences which encode substantially the same or a functionally 
equivalent amino acid sequence may be produced and used to express TRICH. 

20 The nucleotide sequences of the present invention can be engineered using methods generally 

known in the art in order to alter TRICH-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 

25 mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C-C et aL (1999) Nat BiotechnoL 17:793-797; Christians, RC et aL (1999) Nat 

30 BiotechnoL 17:259-264; and Crameri, A. et aL (1996) Nat BiotechnoL 14:315-319) to alter or improve 
the biological properties of TRICH, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
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preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutati ns maybe 
recombined, screened, and then reshuffled until (he desired properties are optimized. Alternatively, 
5 fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable maimer. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in part, 
using chemical methods well known in the art (See, e.g., Caruthers, M JL et aL (1980) Nucleic Acids 

10 Symp. Ser. 7:215-223 ; and Horn, T. et aL (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
TRICH itself or a fragment thereof may be synthesized using chemical methods. For example, 
peptide synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. (1984) Proteins, Structures and Molecular Properties , WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et aL (1995) Science 269:202-204.) Automated synthesis maybe achieved 

15 using the ABI 43 1 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequence 
of TRICH, or any part thereof, may be altered during direct synthesis and/or combined with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide 
having a sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 

20 chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods EnzymoL 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH or 
derivatives thereof maybe inserted into an appropriate expression vector, i.e., a vector which contains 

25 the necessary elements for transcriptional and translational control of the inserted coding sequence in 
a suitable host These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3* untranslated regions in the vector and in polynucleotide sequences 
encoding TRICH Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding TRICH. Such signals 

30 include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 
sequences encoding TRICH and its initiation codon and upstream regulatory sequences are inserted 
into the appropriate expression vector, no additional transcriptional or translational control signals may 
be needed. How ver, in cases where only coding sequence, or a fragment thereof, is inserted, 
exogenous translational control signals including an in-frame ATG initiation codon should be provided 
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by the vector. Exogenous translational elements and initiation codons may be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers 
appropriate for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. 
Cell Differ. 20:125-162.) 

5 Methods which are well known to those skilled in the art may be used to construct expression 

vectors containing sequences encoding TRICH and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory 
Manual Cold Spring Harbor Press, Plainview NY, ck 4, 8, and 16-17; Ausubel, F.M. et aL (1995) 

10 Current Protocols in Molecular Biology. John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding TRICH. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculo virus); 

15 plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. BioL Chem. 264:5503-5509; Engelhard, E.EL et aL (1994) Proc. NatL Acad. ScL USA 
91:3224-3227; Sandig, V. et aL (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, N. (1987) EMBO 

20 J. 6:307-311; The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New 

York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. NatL Acad. Sci. USA 81:3655-3659; and 
Harrington, J J. et aL (1997) Nat Genet 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids , may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, 

25 M. et aL (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. NatL Acad. ScL USA 

90(13):6340-6344; BuDer, R.M. et aL (1985) Nature 317(6040):813-815; McGregor, D.P. et aL (1994) 
MoL ImmunoL 31(3)219-226; and Verma, LM. and N. Somia (1997) Nature 389:239-242.) The 
invention is not limited by the host cell employed. 

la bacterial systems, a number of cloning and expression vectors may be selected depending . 

30 upon the use intended for polynucleotide sequences encoding TRICH. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved using a 
multifunctional B. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (life Technologies). ligation of sequences encoding TRICH into the vector's multiple 
cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
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transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. BioL 
Clem. 2645503-5509.) When large quantities of TRICH are needed, e.g. for the production of 
5 antibodies, vectors which direct high level expression of TRICH may be used. For example, vectors 
containing the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TRICH. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Hchia pastoris . la addition, such 

10 vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra ; 
Bitter, G.A. et aL (1987) Methods EnzymoL 153:516-544; and Scorer, C.A. et aL (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences 

15 encoding TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO L 
6:307-311). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters maybe used. (See, e.g r , Coruzzi, G. et aL (1984) EMBO J. 3:1671-1680; Broglie, R. et aL 
(1984) Science 224:838-843; and Winter, L et aL (1991) Results Probl. Cell Differ. 17:85-105.) These 

20 constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 

transfectioiL (See, e.g.. The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill 
New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated into 

25 an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or £3 region of the viral genome may be used to obtain 
infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 

30 based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
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or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 
5 expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 

10 culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
pbosphoribosyltransferase genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et 
aL (1977) Cell 11523-232; Lowy, L etaL (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or 

15 herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et aL (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. MoL Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 

20 alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
NatL Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anfhocyanins, green fluorescent proteins 
(GFP; Ctontech), B glucuronidase and its substrate B-ghicuronide, or luciferase and its substrate 
hiciferin may be used. These markers can be used not only to identify transformants, but also to 
. quantify the amount of transient or stable protein expression attributable to a specific vector system. 

25 (See, e.g., Rhodes, C.A. (1995) Methods Mol BioL 55:121-13 1.) 

Although the presence/absence of marker gene expression suggests that the gene of interest 
is also present, the presence and expression of the gene may need to be confirmed. For example, if 
the sequence encoding TRICH is inserted within a marker gene sequence, transformed cells 
containing sequences encoding TRICH can be identified by the absence of marker gene function. 

30 Alternatively, a marker gene can be placed in tandem with a sequence encoding TRICH under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well 

In general, host cells that contain the nucleic acid sequence encoding TRICH and that express 
TRICH may be identified by a variety of procedures known to those of skill in the art These 
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procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 
Immunological methods for detecting and measuring the expression of TRICH using either 
5 specific polyclonal or monoclonal antibodies are known in the art Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive binding assay maybe employed. These and other assays are well known in the art (See, 
10 e.g., Hampton, R. et aL (1990) Serological Methods, a Laboratory Manual, APS Press, St Paul MN, 
Sect IV; Coligan, J.E. et at (1997) Current Protocols in Immunology, Greene Pub. Associates and 
Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana 
Press, TotowaNJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 

15 may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding TRICH include 
oligolabeHng, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 

20 and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chrpmogenic 

25 agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intraceDularly depending on the sequence 
and/or the vector used. As win be understood by those of skill in the art, expression vectors containing 

30 polynucleotides which encode TRICH may be designed to contain signal sequences which direct 
secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, gjycosylation, phosphorylation 
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lipidation, and acylation. Post-translational processing which cleaves a '^prepro" or **pro" form of the 
protein may also be used to specify protein targeting* folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WD 8) are available from the American Type Culture 
5 Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and 
processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 

10 containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screeningof peptide libraries for inhibitors of TRICH activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-BBs, FLAG, c-myc> and 

15 hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-BBs enable purification of their cognate fusion 

proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 

20 located between the TRICH encoding sequence and the heterologous protein sequence, so that 
TRICH may be cleaved away from the heterologous moiety following purification. Methods for 
fusion protein expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of 
commercially available lrits may also be used to facilitate expression and purification of fusion proteins. 
In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved in 

25 vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 

systems couple transcription and translation of protein-coding sequences operably associated with the 
17, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, ^S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 

30 that specifically bind to TRICH At least one and up to a plurality of test compounds maybe screened 
for specific binding to TRICH Examples of test compounds include antibodies, oligonucleotides, 
proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
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natural binding partner. (See, e.g., Coligan, J.E. et aL (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TRICH 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
5 these compounds involves producing appropriate cells which express TRICH, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila , or E. 
coli . Cells expressing TRICH or cell membrane fractions which contain TRICH are then contacted 
with a test compound and binding, stimulation, or inhibition of activity of either TRICH or the 
. compound is analyzed. 

10 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 

detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label For example, the 
assay may comprise the steps of combining at least one test compound with TRICH, either in solution 
or affixed to a solid support, and detecting the binding of TRICH to the compound. Alternatively, the 
assay may detect or measure binding of a test compound in the presence of a labeled competitor. 

15 Additionally, the assay may be carried out using cell-free preparations, chemical libraries, or natural 
product mixtures, and the test compound(s) maybe free in solution or affixed to a solid support 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that modulate the activity of TRICH Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for TRICH 

20 activity, wherein TRICH is combined with at least one test compound, and the activity of TRICH in 
the presence of a test compound is compared with the activity of TRICH in the absence of the test 
compound. A change in the activity of TRICH in the presence of the test compound is indicative of a 
compound that modulates the activity of TRICH Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising TRICH under conditions suitable for TRICH activity, and the 

25 assay is performed. In either of these assays, a test compound which modulates the activity of 
TRICH may do so indirectly and need not come in direct contact with the test compound. At least 
one and up to a plurality of test compounds maybe screened. 

In another embodiment, polynucleotides encoding TRICH or their mammalian homologs may 
be <( knocked outf* in an animal model system using homologous recombination in embryonic stem (ES) 

30 cells. Such techniques are well known in the art and are useful for the generation of animal models of 
human disease. (See, e.g., U.S. Patent No. 5,175383 and U.S. Patent No. 5,767337.) For example, 
mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and 
grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 
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244:1288-1292). The vector integrates into the corresponding regi n of the host genome by 
homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, JJD. 
(1996) din. Invest. 97:1999-2002; Wagner, K.U. et aL (1997) Nucleic Acids Res. 25:4323-4330). 
5 Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopreguant dams, and 
the resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous 
strains. Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 
Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 

10 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1145-1147). 

Polynucleotides encoding TRICH can also be used to create tc knockin" humanized animals 

15 (pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, 

20 a mammal inbred to overexpress TRICH, e.g. , by secreting TRICH in its milk, may also serve as a 
convenient source of that protein (Janne, J. et aL (1998) BiotechnoL Annu: Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 
regions of TRICH and transporters and ion channels. In addition, the expression of TRICH is closely 

25 associated with brain, lung, prostate, bladder, bone, hypothalamus, breast, ileum, stomach, pancreas, 
and gastrointestinal tissues and tumors of the brain and prostrate. Therefore, TRICH appears to play 
a role in transport, neurological, muscle, immunological, and cell proliferative disorders, hi the 
treatment of disorders associated with increased TRICH expression or activity, it is desirable to 
decrease the expression or activity of TRICH. In the treatment of disorders associated with 

30 decreased TRICH expression or activity, it is desirable to increase the expression or activity of 
TRICH 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
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such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular 
dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes meDitus, diabetes insipidus, diabetic 
neuropathy, Duchenne muscular dystrophy, hyperkalemia periodic paralysis, normokalemic periodic 
paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, 

5 myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 

neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline myopathy, 
centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, ethanol 
myopathy, dennatomyositis, inclusion body myositis, infectious myositis, polymyositis, neurological 

10 disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, dementia, 
depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other disorders 
associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, 
sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery stenosis, 
sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's 

15 disease, Addison's disease, ghicose-galactose malabsorption syndrome, hypercholesterolemia, 
adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, cystinuria, iminoglyciuiiria, Hartup disease, and Fanconi disease; a neurological disorder such 
as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 

20 amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 
retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases including 
kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 

25 insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 

sclerosis, cerebefloretinal heroangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorders of the central nervous system including Down syndrome, cerebral 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 

30 dennatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degeneration, and familial fiontotemporal dementia; a muscle disorder 
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such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular dystr phy, 
myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, 
mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, inclusion body myositis, 
thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, 
5 cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, myocardial infarction, 
migraine, pheochromocytoma, and myopathies including encephalopathy, epilepsy, Kearns-Sayre 
syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, and acid maltase deficiency (AMD, 
also known as Pompe's disease); an immunological disorder such as acquired immuno deficiency 
syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing 

10 spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, 
atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 

15 thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 

myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, 
Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, 
systemic lupus erythematosus; systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 

20 bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia 
vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukemia, 
lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal 

25 gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, 
heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of TRICH including, but not limited to, 
those provided above. 
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In still another embodiment, an agonist which modulates the activity of TRICH may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to treat or 
5 prevent a disorder associated with increased expression or activity of TRICH. Examples of such 

disorders include, but are not limited to, those transport, neurological, muscle, immunological, and cell 
proliferative disorders described above. In one aspect, an antibody which specifically binds TRICH 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissues which express TRICH. 

10 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 

15 therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergisticaOy to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

20 An antagonist of TRICH may be produced using methods which are generally known in the 

art In particular, purified TRICH may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind TRICH. Antibodies to TRICH may 
also be generated using methods that are well known in the art Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 

25 fragments produced by a Fab expression library. Neutralizing antibodies (i.e. , those which inhibit 
dimer formation) are generally preferred for therapeutic use. . 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and 
others may be immunized by injection with TRICH or with any fragment or oligopeptide thereof which 
has immunogenic properties. Depending on the host species, various adjuvants may be used to 

30 increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenoL Among adjuvants used in humans, BCG 
(bacilli Cahnette-Guerin) and Corvnebacterium parvum are especially preferable. 
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It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
5 of TRICH amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to TRICH may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-ceD hybridoma technique, and the EBV-hybridoma 
10 technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et aL (1985) J. 

ImmunoL Methods 81:31-42; Cote, RJ. et al. (1983) Proa NatL Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et aL (1984) Mol Cell BioL 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
15 antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et aL (1984) Proc. 
NatL Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et aL (1984) Nature 312:604-608; and Takeda, 
S. et aL (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce TRICH-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
20 generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 
D.R. (1991) Proc. NatL Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g., Orlandi, R. et aL (1989) Proc. NatL Acad. ScL USA 86:3833-3837; Winter, 
25 G. et aL (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TRICH may also be generated. 
For example, such fragments include, but are not limited to, F(ab l ) 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
30 easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et aL (1989) Science 246:1275-1281.) 

Various immunoassays maybe used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
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immunoassays typically involve the measurement of complex formation between TRICH and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering TRICH epitopes is generally used, but a competitive binding assay may also be 
employed (Pound, supra) . 

5 Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 

maybe used to assess the affinity of antibodies for TRICH. Affinity is expressed as an association 
constant, K^, which is defined as the molar concentration of TRICH-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The K, 
determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for 

10 multiple TRICH epitopes, represents the average affinity, or avidity, of the antibodies for TRICH. 

The Ka determined for a preparation of monoclonal antibodies, which are monospecific for a particular 
TRICH epitope, represents a true measure of affinity. High-affinity antibody preparations with K, 
ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the TRICH- 
antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with K, 

15 ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of TRICH, preferably in active form, from the 
antibody (Catty, D. (1988) Antibodies. Volume I: A Practical Approach. IRL Press, Washington DC; 
Iiddell, J.E. and A. Cryer (1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, 
New York NY). 

20 The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 

the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of TRICH-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 

25 antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
Coligan et al supra .) 

In another embodiment of the invention, the polynucleotides encoding TRICH, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, modifications 
of gene expression can be achieved by designing complementary sequences or antisense molecules 
30 (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene 
encoding TRICH Such technology is well known in the art, and antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding TRICH (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics. Humana Press Inc., 
TotawaNI.) 
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In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 

5 Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, KJ. et al. (1995) 
9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; AusubeL supra ; Uckert, W. and W. Walther (1994) Pharmacol Iher. 63(3):323-347.) Ofaer 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 

10 systems known in the art (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
aL (1998) J. Pharm Sci. 87(11):1308-1315; and Monis, M.C et aL (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

Jsx another embodiment of the invention, polynucleotides encoding TRICH maybe used for 
somatic or gennline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 

15 (e.g., in the cases of severe combined immunodeficiency (SCED)-X1 disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et aL (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et aL (1995) Science 270:475-480; Bordignon, C. et aL (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et aL (1993) Cell 75:207-216; Crystal, R.G. et aL (1995) Hum. Gene 

20 Therapy 6:643-666; Crystal, R.G. et aL (1995) Hum. Gene Therapy 6:667-703), thalassemias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor DC deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, LM. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 

25 against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescUa, E. et al. (1996) Proc. NatL Acad. Sci. USA. 93:11395-11399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis : and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzT) . In the 
case where a genetic deficiency in TRICH expression or regulation causes disease, the expression of 

30 TRICH from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies for 
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use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and 
(v) the use of DNA transposons (Morgan, R.A. and W.R Anderson (1993) Annu. Rev. Biochem. 
62:19-1-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. Opin. 

5 BiotechnoL 9:445-450). 

Expression vectors that may be effective for the expression of TRICH include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSHZPERV (Stratagene, La JoDa CA), 
and PTET-OFF, FTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TRICH 

10 may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter 
(e.g., the tetxacycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. NatL Acad. Sci. 
USA 895547-5551; Gossen, M. et aL (1995) Science 268:1766-1769; Rossi, F.M.V. and EM. Blau 
(1998) Curr. Opin. BiotechnoL 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); 

15 the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and Blau, ELM. supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TRICH from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 

20 TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A. J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et aL 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of these 

25 standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 

30 element (RRE) along with additional retrovirus m-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PEBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
NatL Acad. Scl USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
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receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et aL 
(1987) J. Virol 61:1647-1650; Bender, MA. et aL (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et aL (1998) J. Virol. 72:8463-8471; Zufferey, R el 
aL (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 
5 retrovirus packaging cell lines producing Mgfc transducing effidency retrod 

a method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., Ct>4 + T-cells), and the 
return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et aL (1997) J. ViroL 71:7020-7029; Bauer, G. et 

10 aL (1997) Blood 89:2259-2267; Bonyhadi, MX. (1997) J. ViroL 71:4707-4716; Ranga, U. et al. (1998) 
Proc. NatL Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with respect 
to the expression of TRICH. The construction and packaging of adenovirus-based vectors are well 

15 known to those with ordinary skill in the art Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy")* hereby 
incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et aL (1999) Annu. Rev. 

20 Nutr. 19511-544 and Venna, LM. and N. Somia (1997) Nature 18:389:239-242, both incorporated by 
reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 
respect to the expression of TRICK The use of herpes simplex virus (HSV)-based vectors may be 

25 especially valuable for introducing TRICH to cells of the central nervous system, for which HS V has 
a tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X et aL (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 

30 Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 
which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by 
this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and 
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ICP22. For HSV vectors, see also Goins, W.R et al (1999) J. Virol. 73:519-532 and Xu, H. et al. 
(1994) Dev. BioL 163:152-161, hereby incorporated by reference. The manipulation of cloned 
herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
5 of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 

10 on the SFV genome (Garoff, EL and K-J. Li (1998) Curr. Opin. BiotechnoL 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for TRICH into the 

15 alphavirus genome in place of the capsid-coding region results in the production of a large number of 
TRICH-coding RNAs and the synthesis of high levels of TRICH in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 
persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 

20 application (Dryga, S.A. et aL (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of TRICH into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 
manipulating infectious cDNA clones of alphaviruses, pejcforming alphavirus cDNA and RNA 
transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 

25 art 

Oligonucleotides derived from die transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
30 transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, IE. et al. (1994) in Huber, B.R and B.L Carr, 
Molecular and Immunologic Approaches . Futura Publishing, Mt Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translati n f mRNA 
by preventing the transcript from binding to rib'osomes. 
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Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 

molecule to complementary target RNA, followed by endonucleolytic cleavag . For example, 

engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 

5 endonucleolytic cleavage of sequences encoding TRICK 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 

GUU, and GUC Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 

corresponding to the region of the target gene containing the cleavage site, maybe evaluated for 

10 secondary structural features which may render the oligonucleotide inoperable. The suitability of 

candidate targets may also be evaluated by testing accessibility to hybridization with complementary 

oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 

by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 

IS for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 

Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 

sequences encoding TRICK. Such DNA sequences may be incorporated into a wide variety of 

vectors with suitable RNA polymerase promoters such as 17 or SP6. Alternatively, these cDNA 

constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into cell 

* 

20 lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3 ' ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 

25 extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICK Compounds 

30 which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
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polynucleotide expression. Thus, in the treatment of disorders associated with increased TRICH 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding TRICH may be therapeutically useful, and in the treatment of disorders associated with 
decreased TRICH expression or activity, a compound which specifically promotes expression of the 
5 polynucleotide encoding TRICH may be therapeutically useful. 

At least one, and up to a plurality, of test compounds maybe screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 

10 library of naturaUyK>ccurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatoriafly or randomly. A sample comprising a 
polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or penneabilized cell, or an in vitro cell-free or reconstituted 

15 biochemical system. Alterations in the expression of a polynucleotide encoding TRICH are assayed 
by any method commonly known in the art Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus forming 
the basis for a comparison of the expression of the polynucleotide both with and without exposure to 

20 one or more test compounds. Detection of a change in the expression of a polynucleotide exposed to 
a test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomvces pombe gene expression system (Atkins, 
D. et aL (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et aL (2000) Nucleic Acids Res. 28:E15) or a 

25 human cell line such as HeLa cell (Clarke, M.L. et aL (2000) Biochemu Biophys. Res. Commun. 

268:8-13). A particular embodiment of the present invention involves screening a combinatorial library 
of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 
oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. et a! 
(1997) U.S. Patent No. 5,686,242; Bruice, T.W, et aL (2000) U.S. Patent No. 6,022,691). 

30 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo, in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
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using methods which are well known in the art (See, e.g., Goldman, C.K. et aT (1997) Nat 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
5 monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 
generally comprises an active ingredient formulated with a pharmaceuticaDy acceptable excipient 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 

10 Pharmaceutical Sciences (Maack Publishing. Baston PA). Such compositions may consist of TRICH, 
antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of TRICH. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 

15 sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast- 
acting formulations is well-known in the art In the case of macromolecules (e.g. larger peptides and 

20 proteins), recent developments in the field of pulmonary delivery via the alveolar region of the lung 
have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. 
et aL, U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without 
needle injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 

25 ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 

30 macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HTV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al (1999) Science 285:1569-1572). 
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For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to detennine useful doses and routes for 
5 administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of TRICH, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity maybe determined 
by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 

10 calculating the ED^ (the dose therapeutically effective in 50% of the population) or LD^ (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDx/ED 50 ratio. Compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 

15 preferably within a range of circulating concentrations that includes the ED 50 with Htfle or no toxicity. 
The dosage varies within this range depending upon Ihe dosage form employed, the sensitivity of the 
patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment Dosage and administration are adjusted to provide sufficient levels of the 

20 active moiety or to main tain the desired effect Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination^), reaction sensitivities, and response 
to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

25 Normal dosage amounts may vary from about 0.1 fig to 100,000 ^g, up to a total dose of 

about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

30 conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TRICH may be used for the 
diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for diagnostic 
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purposes may b prepared in the skme manner as described above for therapeutics. Diagnostic 
assays for TRICH include methods which utilize the antibody and a label to detect TRICH in human 
body fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, 
and may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
5 reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring TRICH, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. Normal 
or standard values for TRICH expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to TRICH under 

10 conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TRICH expressed in 
subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used for 

15 diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of TRICH may be correlated 
• with disease. The diagnostic assay maybe used to determine absence, presence, and excess 
expression of TRICH, and to monitor regulation of TRICH levels during therapeutic intervention. 

20 In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

sequences, including genomic sequences, encoding TRICH or closely related molecules may be used 
to identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5 'regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 

25 probe identifies only naturally occurring sequences encoding TRICH, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:27-52 or from 
30 genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 
cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
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polymerases and the appropriate labeled nucleotides. Hybridization probes maybe labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 

5 associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy. Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 
normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 

10 myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, 
cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, 
bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, 
nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic 
myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, infectious myositis, 

15 polymyositis, neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, 
bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and 
schizophrenia, and other disorders associated with transport, e.g., neurofibromatosis, postherpetic 
neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, 
pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's 

20 disease, goiter, Cushing's disease, Addison's disease, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
syndrome, von Gierke disease, cystinuria, iroino glycinuria, Hartup disease, and Fanconi disease; a 
neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 

25 extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker 

30 syndrome, fatal familial insomnia, nutritional and metabolic diseases of the nervous system, 

neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal 
syndrome, mental retardation and other developmental disorders of the central nervous system 
including Down syndrome, cerebral palsy, neuroskeletal dis rders, autonomic nervous system 
disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular 



62 



WO 02/22684 



PCT7US01/28938 



disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
5 neuralgia, Tourette- s disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; a muscle disorder such as cardiomyopathy, myocarditis, Duchenne's 
muscular dystrophy, Becker's muscular dystrophy, myotonic dystrophy, central core disease, nemaSne 
myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, infectious myositis, 
polymyositis, dermatomyositis, inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, 

10 anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, 
hypoglycemia, myocardial infarction, migraine, pheochromocytoma, and myopathies including 
encephalopathy, epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, 
ophthalmoplegia, and acid maltase deficiency (AMD, also known as Pompe's disease); an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 

15 adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes meHitus, emphysema, episodic 
lymphopenia with lymphocyte toxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 

20 glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 

25 syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 

fungal, parasitic, protozoal, and helminthic infections, and trauma; and a cell proliferative disorder such 
as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective 
tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, 
psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, 

30 melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, 
kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
testis, thymus, thyroid, and uterus. The polynucleotide sequences encoding TRICH may b used in 
Southern or northern analysis, dot blot, or other membrane-based technologies; in PGR technologies; in 
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dipstick, pin, and multiformat ELISA-like assays; and in microairays utilizing fluids or tissues from 
patients to detect altered TRICH expression. Such qualitative or quantitative methods are well known 
in the art 

In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays that 
5 detect the presence of associated disorders, particularly those mentioned above. The nucleotide 

sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 
value. If the amount of signal in the patient sample is significantly altered in comparison to a control 

10 sample then the presence of altered levels of nucleotide sequences encoding TRICH in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the 
efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor 
the treatment of an individual patient 

In order to provide a basis for the diagnosis of a disorder associated with expression of 

15 TRICH, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding TRICH, under conditions suitable for hybridization or 
amplification Standard hybridization may be quantified by comparing the values obtained from normal 
subjects with values from an experiment in which a known amount of a substantially purified 

20 polynucleotide is used. Standard values obtained in this manner may be compared with values 

obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in die 

25 patient begins to approximate that which is observed in the normal subject The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
30 of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 
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Additional diagnostic uses for oligonucleotides designed from the sequences encoding TRICH 
may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding TRICH, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
5 TRICH, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 

10 substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding TRICH are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 

15 tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the 

secondary and tertiary structures of PCR products in single-stranded form, and these differences are 
detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are 
fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such as 
DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 

20 (isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 

overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the 

25 high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C et aL (1993) J. Immunol. Methods 159:235-244; Duplaa, C 
et aL (1993) Anal Biochem. 212:229-236.) The speed of quantitation of multiple samples maybe 

30 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometry or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any f the 
polynucleotide sequences described herein may be used as elements on a microarray. The micro array 
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can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The micro array may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
5 progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient For example, therapeutic agents which are highly effective and 
display the fewest side effects may be selected for a patient based on his/her pharmacogenomic 
10 profile. 

In another embodiment, TRICH, fragments of TRICK, or antibodies specific for TRICHmay 
be used as elements on a microarray. The micro array may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

15 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time. (See Seilhamer et aL, 4 'Comparative Gene Transcript Analysis," U.S. Patent No. 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 

20 hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

25 Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 

or other biological samples. The transcript image may thus reflect gene expression in vivo , as in the 
case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
may also be used in conjunction with in vitro model systems and preclinical evaluation of 

30 pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.R et aL (1999) MoL Carcinog. 24:153-159; Steiner, S. and NX. Anderson (2000) 
ToxicoL Lett. 1 12-1 13 :467-47 1, expressly incorporated by reference herein). If a test compound has a 
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signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome-wide measurement of expression 
provides the highest quality signature. Even genes whose expression is not altered by any tested 
5 compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
10 example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at http^Aww.niehs.nikgov/oc^news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 

15 containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 

biological sample are hybridized with one or more probes specific to the polynucleotides of the present 
invention, so that transcript levels corresponding to the polynucleotides of the present invention may be 
quantified. The transcript levels in the treated biological sample are compared with levels in an 
untreated biological sample. Differences in the transcript levels between the two samples are 

20 indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 

25 analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating 
and analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 

30 sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra). The proteins 
are visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot 
is generally proportional to the level of the protein in the sample. The optical densities of equivalent!/ 
positioned protein spots from different samples, for example, from biological samples either treated or 
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untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass 
spectrometry. The identity of the protein in a spot may be determined by comparing its partial 
5 sequence, preferably of at least 5 contiguous amino acid residues, to the polypeptide sequences of the 
present invention. In some cases, further sequence data may be obtained for definitive protein 
identification. 

A proteomic profile may also be generated using antibodies specific for TRICH to quantify 
the levels of TRICH expression. In one embodiment, the antibodies are used as elements on a 

10 microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
detecting the levels of protein bound to each array element (Luelring, A. et aL (1999) Anal. Biochem. 
270:103-111; Mendoze, L.G. et aL (1999) Biotechniques 27:778-788). Detection maybe performed by 
a variety of methods known in the art, for example, by reacting the proteins in the sample with a flriol- 
or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at each 

15 array element 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level There is a poor 
correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18533-537), so proteome toxicant signatures maybe 

20 useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 

25 biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological sample. 
A difference in the amount of protein between the two samples is indicative of a toxic response to the 
test compound in the treated sample. Individual proteins are identified by sequencing the amino acid 
residues of the individual proteins and comparing these partial sequences to the polypeptides of the 

30 present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared 
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with the amount in an untreated biological sample. A difference in the amount of protein between the 
two samples is indicative of a toxic response to th test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Biennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et aL (1996) Proc. Natl. Acad. 
5 Sci. USA 93:10614-10619; Baldeschweiler et aL (1995) PCT application W095/251116; Sbalon, D. et 
al. (1995) PCT application WO95/35505; Heller, RA. et al. (1997) Proc. NatL Acad. Sci. USA 
94:2150-2155; and Heller, M.J. et aL (1997) U.S. Patent No. 5,605,662.) Various types of 
microarrays are well known and thoroughly described in DNA Microarrays: A Practical Approach . 
M. Schena, ed. (1999) Oxford University Press, London, hereby expressly incorporated by reference. 

10 In another embodiment of the invention, nucleic acid sequences encoding TRICH may be 

used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences maybe used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 

15 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 

region of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et aL (1997) Nat 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127434; and Trask, B J. (1991) Trends Genet 

20 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop 
genetic linkage maps, for example, which correlate the inheritance of a disease state with the 
inheritance of a particular chromosome region or restriction fragment length polymorphism (RFLP). 
(See, for example, Lander, E-S. and D. Botstein (1986) Proc. NatL Acad. Sci. USA 83:7353-7357.) 
Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 

25 map data. (See, e.g., Heinz-Ulrich, et al (1995) in Meyers, supra , pp. 965-968.) Examples of genetic 
map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between die location of the gene encoding TRICH on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

30 In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on die chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
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techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to llq22-23, any 
sequences mapping to that area may represent associated or regulatory genes for further investigation. 
(See, e.g., Gatti, R.A. et aL (1988) Nature 336:577-580.) Hze nucleotide sequence of the instant 
5 invention may also be used to detect differences in the chromosomal location due to translocation, 
inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drag 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

10 solid support, borue on a cell surface, or located intracettularly. The formation of binding complexes 
between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (See, e.g., Geysen, et aL (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 

15 synthesized on a solid substrate. Hie test compounds are reacted with TRICH, or fragments thereof, 
and washed. Bound TRICH is then detected by methods well known in the art Purified TRICH can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support 

20 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding TRICH specifically compete with a test compound for binding TRICH 

In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 

antigenic determinants with TRICH 

hi additional embodiments, the nucleotide sequences which encode TRICH may be used in 
25 any molecular biology techniques that have yet to be developed, provided the new techniques rely on 

properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent The following embodiments are, therefore, 
30 to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way 

whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below and 
including U.S. Ser. No. 60/232,685, U.S. Ser. No. 60/234,842, U.S. Ser. No. 60/236,882, U.S. Ser. 
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, No. 607239,057, U.S. Ser. No. 607240,540, and U.S. Ser. No. 60/241 ,700 are expressly incorporated 
by reference herein. 

EXAMPLES 

5 I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the UFESEQ GOLD 
database (Incyte Genomics, Palo Alto CA) and shown in Table 4, column 5. Some tissues were 
homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in 
phenol or in a suitable mixture of denaturants, such as TREZOL (life Technologies), a monophasic 

10 solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 

15 isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

20 libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 

UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (life Technologies), using 
the recommended procedures or similar methods known in the art (See, e.g., Ausubel, 1997, supra, 
units 5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 

25 appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL S1000, SBPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRUT plasmid (Stratagene), PSPORT1 plasmid (life Technologies), PCDNA2.1 plasmid 

30 (Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), PCR2-TOPOTA (Invitrogen), PCMV- 
ICIS (Stratagene), or pINCY (Incyte Genomics, Palo Alto CA), or derivatives thereof. Recombinant 
plasmids were transformed into competent E. coli cells including XLl-Btoe, XLl-BhieMRF, or SOLR 
from Stratagene or DHSct, DH10B, or ElectroMAX DH10B from life Technologies. 
II. Isolation of cDNA Clones 
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Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
5 QIAWELL 8 Phis Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the RE.AX. PREP 
96 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
ml of distilled water and stored, with or without lyophilization, at 4°C 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) AnaL Biochem. 216:1-14). Host cell lysis and thermal 

10 cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

3 84- well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

15 Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 

20 using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 

25 protocols and base calling software; or other sequence analysis systems known in the art Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ansubel, 
1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the 
techniques disclosed in Example Vm. 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 

30 vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBahk primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM, and hidden Markov model (HMM>based protein family 
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databases such. as PFAM. (HMM is a probabilistic approach which analyzes consensus primary 
structures of gene families. See, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 
The queries were performed using programs based on BLAST, FASTA, BLIMPS, and HMMER. 
The Incyte cDNA sequences were assembled to produce full length polynucleotide sequences. 
5 Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or 
Genscan-predicted coding sequences (see Examples IV and V) were used to extend Incyte cDNA 
assemblages to full length Assembly was performed using programs based on Phred, Phrap, and 
Consed, and cDNA assemblages were screened for open reading frames using programs based on 
GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive 

10 the corresponding full length polypeptide sequences. Alternatively, a polypeptide of the invention may 
begin at any of the methionine residues of the full length translated polypeptide. Full length polypeptide 
sequences were subsequently analyzed by querying against databases such as the GenBank protein 
databases (genpept), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and hidden 
Markov model (HMM>based protein family databases such as PFAM Full length polynucleotide 

15 sequences are also analyzed using MACDNASIS PRO software (Hitachi Software Engineering, 

South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide 
sequence alignments are generated using default parameters specified by the CLUSTAL algorithm as 
incorporated into the MEGAUGN muMsequence alignment program (DNASTAR), which also 
calculates the percent identity between aligned sequences. 

20 Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 

Incyte cDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, die second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 
which are incorporated by reference herein in their entirety, and the fourth column presents, where 

25 applicable, the scores, probability values, and other parameters used to evaluate the strength of a 
match between two sequences (the higher the score or the lower the probability value, the greater the 
identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 

30 NO:27-52. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 4. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 
identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is 
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a general-purpose gene identification program which analyzes genomic DNA sequences from a 
variety of organisms (See Burge, C. and S. Karlin (1997) J. Mol. BioL 268:78-94, and Burge, C. and 
S. Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 
5 Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range of 
sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 
predicted cDNA sequences encode transporters and ion channels, the encoded polypeptides were 
analyzed by querying against PFAM models for transporters and ion channels. Potential transporters 
and ion channels were also identified by homology to Incyte cDNA sequences that had been 

10 annotated as transporters and ion channels. These selected Genscan-predicted sequences were then 
compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 
Genscan-predicted sequences were then edited by comparison to the top BLAST hit from genpept to 
correct errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST 
analysis was also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted 

15 sequences, thus providing evidence for transcription. When Incyte cDNA coverage was available, 
this information was used to correct or confirm the Genscan predicted sequence. Full length 
polynucleotide sequences were obtained by assembling Genscan-predicted coding sequences with 
Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Example IH Alternatively, full length polynucleotide sequences were derived entirely from edited or 

20 unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 

25 III were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 
based on graph theory and dynamic programming to integrate cDNA and genomic information, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
fiiB length sequence. Sequence intervals in which the entire length of the interval was present on 

30 more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, then all three intervals were considered to be equivalent This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear 
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along their parent sequences to generate the longest possible sequence, as well as sequence variants, 
linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 

5 by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to foil length with an algorithm based on BLAST 

10 analysis. First, partial cDNAs assembled as described in Example HI were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 

15 (HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for homologous 
genomic sequences from the public human genome databases. Partial DNA sequences were 
therefore "stretched" or extended by the addition of homologous genomic sequences. The resultant 

20 stretched sequences were examined to determine whether it contained a complete gene. 
VL Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:27-52 were compared with 
sequences from the Incyte LDFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Wateiman algorithm. Sequences from these databases that matched 

25 SEQ ID NO:27-52 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap CTable 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 

30 of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
ann. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
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humans, although this can vary widely due to hot and cold spots of recombination.) Th cM distances 
are based on genetic markers mapped by Genethon which provide boundaries for radiation hybrid 
markers whose sequences were inchid d in each of the clusters. Human genome maps and other 
resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
5 (http:/Avww.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified disease 
genes map within or in proximity to the intervals indicated above. 

In this manner, SEQ ID NO:31 was mapped to chromosome 1 within the interval from 133.00 
to 137.30 centiMorgans. SEQ ID NO:33 was mapped to chromosome 12 within the interval from 
120.50 to the q terminal, or more specifically, within the interval from 126.10 to 145.70 centiMorgans. 

10 VIL Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel 
(1995) supra, ch. 4 and 16.) 

15 Analogous computer techniques applying BLAST were used to search for identical or related 

molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations, In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

20 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
25 length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
30 gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
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other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 
tissue sources from which they were derived. For example, some full length sequences are 
5 assembled, at least in part, with overlapping Incyte cDNA sequences (see Example HI). Each cDNA 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective tissue; 
digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, 
male; germ cells; hemic and immune system; liver; musculoskeletal system; nervous system; 

10 pancreas; respiratory system; sense organs; skin; stomato gnathic system; unclassified/mixed; or 

urinary tract The number of libraries in each category is counted and divided by the total number of 
libraries across all categories. Similarly, each human tissue is classified into one of the following 
disease/condition categories: cancer, cell line, developmental, inflammation, neurological, trauma, 
cardiovascular, pooled, and other, and die number of libraries in each category is counted and divided 

15 by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding TRICH. cDNA sequences and cDNA library/tissue 
information are found in the LIFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 
VIII. Extension of TRICH Encoding Polynucleotides 

Fall length polynucleotide sequences were also produced by extension of an appropriate 

20 fragment of the foil length molecule using oligonucleotide primers designed from this fragment One 
primer was synthesized to initiate 5 ' extension of the known fragment, and the other primer was 
synthesized to initiate 3 ' extension of the known fragment The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 

25 sequence at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art PCR 

30 was performed in 96-wefl plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH^SC^, 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE 
enzyme (life Technologies), and Pfii DNA polymerase (Stratagene), with the following parameters 
for primer pair PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; 



77 



WO 02/22684 



PCT/US01/28938 



Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage 
at 4°C. In the alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94 °C, 
3 min; Step 2: 94°C, 15 sec; Step 3: 57 °C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 
repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. 
5 The concentration of DNA in each well was determined by dispensing 100 /il PICOGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 /zl of undiluted PGR product into each well of an opaque fluorimeter plate (Coining Costar, 
Acton MA), allowing the DNA to bind to the reagent The plate was scanned in a Ehioroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
10 concentration of DNA. A 5 a*1 to 10 iA aliquot of the reaction mixture was analyzed by 

electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-weH plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 

15 sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on bw concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 
(Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction 

20 site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 

antibioticHX>ntaining media, and individual colonies were picked and cultured overnight at 37 °C in 384- 
weH plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

25 parameters: Step 1: 94°C, 3 min; Step 2: 94 °C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 
5: steps 2, 3, and 4 repeated 29 times; Step 6: 72 °C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted with 
20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 

30 primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5' regulatory sequences using the above procedure along with oligonucleotides 
designed for such extension, and an appropriate genomic library. 
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IX* Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:27-52 are employed to screen cDNAs, 

genomic DNAs, or rnRNAs. Although the labeling of olig nucleotides, consisting of about 20 base 

pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
5 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 

software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 jiCi of 

[Y- 32 j>] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 

(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
10 An aliquot containing 10 7 counts per minute of die labeled probe is used in a typical membrane-based 

hybridization analysis of human genomic DNA digested with one of the following endonucleases: Ase 

I, Bgl H, Eco RI, Pst I, Xba I, or Pvu H (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Phis, Schleicher & SchueH, Durham NH). Hybridization is carried out for 16 
15 hours at 40 °C. To remove'nonspecific signals, blots are sequentially washed at room temperature 

under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 

Hybridization patterns are visualized using autoradiography or an alternative imaging means and 

compared. 

X, Microarrays 

20 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra.), mechanical 
microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . 
Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a 

25 procedure analogous to a dot or slot blot may also be used to arrange and link elements to the surface 
of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may 
be produced using available methods and machines well known to those of ordinary skill in the art and 
may contain any appropriate number of elements. (See, e.g., Schena, M. et aL (1995) Science 
270:467-470; Shalon, D. et aL (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) 

30 Nat BiotechnoL 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGBNE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
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biological sampl are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
5 complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the micro array maybe assessed, hi one embodiment, microairay preparation and usage is described 
in detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

10 poly(A) + RNA is purified using the oligo-(dT) cellulose method. Bach poly(A) + RNA sample is 

reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/jd oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units//il RNase inhibitor, 500 /iM dATP, 500 fM dGTP, 500 flM dTTP, 40 /xM 
dCTP, 40 pM dCTP-Cy3 (EDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 

15 GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 

20 (CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended 
m!4/il5XSSC/0.2%SDS. 
Microarrav Preparation 

25 Sequences of the present invention are used to generate array elements. Each array element 

is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 jxg. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

30 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Coining) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
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coated with 0.05% aminopropyl silane (Sigma) in 95% elhanoL Coated slides are cured in a 110°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 fjl of the array element DNA, at an average 
5 concentration of 100 ng//il, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Micro arrays are UV-crosslinked using a STRATALENKER UV-crosslinker (Stratagene). 
Micro arrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
10 buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 0.2% 
SDS and distilled water as before; 
Hybridization 

Hybridization reactions contain 9 /il of sample mixture consisting of 0.2 fig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

15 mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 
fil of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 
6.5 hours at 60° C. The arrays are washed for 10 min at 45°C in a first wash buffer (IX SSC, 0.1% 

20 SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
hmova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 

25 focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

30 Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

Hamamatsu Photonics Systems, Bridgewater NI) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
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typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
5 the array contains a complementary DNA sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples from 
different sources (e.g., representing test and control cells), each labeled with a different fluorophore, 
are hybridized to a single array for the purpose of identifying genes that are differentially expressed, 
die calibration is done by labeling samples of the calibrating cDNA with the two ftuorophores and 

10 adding identical amounts of each to die hybridization mixture. 

The output of the photomuMplier tube is digitized using a 12-bit RTT-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 

15 signal). The data is also analyzed quantitatively. Where two different fluorophpres are excited and 
measured simultaneously, the data are first collected for optical crosstalk (due to overlapping emission 
spectra) between the ftuorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 

20 to obtain a numerical value corresponding to the average intensity of the signal The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

XI. Complementary Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring TRICH Although use of 

25 oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OUGO 4.06 software (National Biosciences) and the coding sequence of TRICH To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique S 7 sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a complementary 

30 oligonucleotide is designed to prevent ribosomal binding to the TRICH-encoding transcript 
XIL Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 



82 



WO 02/22684 



PCT/US01/28938 



transcription. Examples of such promoters include, but are not limited to, the trp-lac (tad) hybrid 
promoter and the T5 or T7 bacteriophag promoter in conjunction with the lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- 
5 fhiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TRICH by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
10 polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, B.K. et 
aL (1994) Proc. Natl Acad. Sci. USA 91:3224-3227; Sandig, V. et al (1996) Hum. Gene Ther. 
7:1937-1945.) 

15 In most expression systems, TRICH is synthesized as a fusion protein with, e.g. , glutathione 

S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-ltilodalton 
enzyme from Schistosoma iaponicum, enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity ( Amersham Pharmacia 

20 Biotech). Following purification, the GST moiety can be proteolytically cleaved from TRICH at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables iromunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 

25 ch. 10 and 16). Purified TRICH obtained by these methods can be used directly in the assays shown 
in Examples XVI, XVII, and XVIII where applicable. 
Xm. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 

30 vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT (Life Technologies) and PCR3.1 (Ihvitrogen, Carlsbad CA), both of which 
contain the cytomegalovirus promoter. 5-10 fig of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 \i% of an additional plasmid containing sequences encoding a 
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marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
5 based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward light scatter and 90 degree side light 

10 scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 
specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
ftuorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Qrmerod, M.G. (1994) Flow Cytometry , Oxford, New York NY. 

15 The influence of TRICH on gene expression can be assessed using highly purified populations 

of cells transfected with sequences encoding TRICH and either GD64 or CD64-GEP. CD64 and 
CD64-GEP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 

20 NY). mRNA can be purified from the cells using methods well known by those of skill in the art 
Expression of mRNA encoding TRICH and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 
XIV, Production of TRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

25 Harrington, M.G. (1990) Methods EnzymoL 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art Methods for 

30 selection of appropriate epitopes, such as those near the C-terminus or inhydrophilic regions are well 
described in the art (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 43 1A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysucxiniirude ester (MBS) to 
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increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 
Bgopeptide-KLH complex in complete Freund's adjuvant Resulting antisera are tested for 
antipeptide and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat 
5 anti-rabbit IgG. 

XV. Purification of Naturally Occurring TRICH Using Specific Antibodies 
Naturally occurring or recombinant TRICH is substantially purified by imrnunoaffinity 

chromatography using antibodies specific for TRICH An imrnunoaffinity column is constructed by 
covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 

10 CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing TRICH are passed over the imrnunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 

15 antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and TRICH is collected. 

XVI. Identification of Molecules Which Interact with TRICH 

Molecules which interact with TRICH may include transporter substrates, agonists or 
antagonists, modulatory proteins such as Gffy proteins (Reimann, supra") or proteins involved in TRICH 

20 localization or clustering such as MAGUKs (Craven, supra) . TRICH, or biologically active fragments 
thereof, are labeled with m I Bolton-Hunter reagent (See, e.g., Bolton A.E. and W.M. Hunter (1973) 
Eiochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a multi-well plate 
are incubated with the labeled TRICH, washed, and any wells with labeled TRICH complex are 
assayed. Data obtained using different concentrations of TRICH are used to calculate values for the 

25 number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
(Helds, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 
fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins arc 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion 

30 protein and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute a 
transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems are 
commercially available, and methods for use of the yeast 2-hybrid system with ion channel proteins 
are discussed in Niethammer, M. and M. Sheng (1998, Meth Enzymol 293:104-122). 
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TRICK may also be used in the PATHCALLJNG process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

5 Potential TRICH agonists or antagonists may be tested for activation or inhibition of TRICH 

ion channel activity using the assays described in section XVIII. 
XVII. Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 
conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 

10 or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those skilled 
in the art A second plasmid which expresses any one of a number of marker genes, such as B- 
galactosidase, is co-transformed into the cells to allow rapid identification of those cells which have 
taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after 

15 transformation under conditions appropriate for the cell line to allow expression and accumulation of 
TRICH and B-galactosidase. 

Transformed cells expressing B-galactosidase are stained blue when a suitable colorimetric 
substrate is added to the culture media under conditions that are well known in the art Stained cells 
are tested for differences in membrane conductance by electrophysiological techniques that are well 

20 known in the art Untransformed cells, and/or cells transformed with either vector sequences alone or 
B-galactosidase sequences alone, are used as controls and tested in parallel Cells expressing TRICH 
will have higher anion or cation conductance relative to control cells. The contribution of TRICH to 
conductance can be confirmed by incubating the cells using antibodies specific for TRICH The 
antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in the ion channel, 

25 and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi et 
aL, supra ; Jegja, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 
appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 

30 mature stage IV oocytes. Injected oocytes are incubated at 18 °C for 1-5 days. Inside-out 

macropatches are excised into an intracellular solution containing 116 mM K-ghiconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations of 
th TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCy, where appropriate. 
Electrode resistance is set at 2-5 M£2 and electrodes are filled with the intracellular solution lacking 
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mediator. Experiments are performed at room temperature from a holding potential of 0 mV. Voltage 
ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. Current 
measured is proportional to th activity of TRICH in the assay. In particular, the activity of TRICH- 
25 is measured as CI- conductance. 
5 Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 

Xenopus laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 
oocyte) and incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 2 , 
ImM MgQ 2 , ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , SOjigfaoI gentamycin, pH 7.8) to allow 
expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM Nad, 2 

10 mM KC1, ImM CaQ 2 , ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., 
amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. 
radiolabeled with ^ fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 
30 minutes, uptake is terminated by washing the oocytes three times in NaMree medium, measuring 
the incorporated label, and comparing with controls. TRICH activity is proportional to the level of 

15 internalized labeled substrate. In particular, test substrates include amino acids for TRICH-1, xanthine 
and uracil for TRICH-3, mellbiose for TRICH-18, monocarboxylate for TRICH-20, neurotransmitters 
such as gamma-aminobutyric acid (GABA) for TRICH-22, and nucleosides for TRICH-23. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 
[y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of the 

20 recovered 32 P using a scintillation counter. The reaction mixture contains ATP-fr- 32 ?] and varying 
amounts of TRICH in a suitable buffer incubated at 37 °C for a suitable period of time. The reaction 
is terminated by acid precipitation with trichloroacetic acid and then neutralized with base, and an 
aliquot of the reaction mixture is subjected to membrane or filter paper-based chromatography to 
separate the reaction products. The amount of 32 P liberated is counted in a scintillation counter. The 

25 amount of radioactivity recovered is proportional to the ATPase activity of TRICH in the assay. 
XVm. Identification of TRICH Agonists and Antagonists 

TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 
(Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in the 
presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using 

30 patch clamp methods well known in the art or as described in Example XVII. Alternatively, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 
membrane (Velicelebi, G. etaL (1999) Meth. EnzymoL 294:2047; West, MR. and CR. Molloy 
(1996) Anal. Biochem. 24151-58). These assays maybe adapted for high-throughput screening using 
microplates. Changes in internal i n concentration are measured using fluorescent dyes such as the 



87 



WO 02/22684 



PCT/US01/28938 



Ca 2+ indicator Fhio-4 AM, sodium-sensitive dyes such as SBH and sodium green, or the CI" indicator 
MQAE (all available fr m Molecular Probes) in combination with the ELEPR fluorimetric plate reading 
system (Molecular Devices). In a more generic version of this assay, changes in membrane potential 
caused by ionic fhix across the plasma membrane are measured using oxonyl dyes such as DiB AC 4 

5 (Molecular Probes). DiBAC 4 equilibrates between the extracellular solution and cellular sites 
according to the cellular membrane potential. The dye's fluorescence intensity is 20-fold greater 
when bound to hydrophobic intracellular sites, allowing detection of DiBAC 4 entry into the cell 
(Gonzalez, IE. and P.A. Negulescu (1998) Curr. Opin. BiotechnoL 9:624-631). Candidate agonists or 
antagonists maybe selected from known ion channel agonists or antagonists, peptide libraries, or 

10 combinatorial chemical libraries. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
15 understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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What is claimed is: 



1. An isolated polypeptide selected from the group consisting of: 
e a) a polypeptide comprising an amino acid sequence selected from the group consisting 
5 ofSEQIDNO:l-26, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-26, 

c) a biologically active fragment of a polypeptide having an amino acid sequence 
10 selected from the group consisting of SEQ ID NO:l-26, and 

d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO:l»26. 



15 



2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l-26. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 



4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20' 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ED 

NO:27-52. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 . * 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 



30 9. A method of producing a polypeptide of claim 1, the method comprising: 

a) cuburing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 
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b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide has an amino acid sequence selected from 
the group consisting of SEQ ID NO:l-26. 

5 

11. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
10 consisting of SEQ ID NO:27-52, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ 
IDNO:27-52, 

c) a polynucleotide complementary to a polynucleotide of a), 

15 d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

20 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and 

25 which probe specifically hybridizes to said target polynucleotide, under conditions 

whereby a hybridization complex is formed between said probe and said target 
polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, die amount thereof. 

30 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 
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a) amplifying said target polynucleotide ox fragment thereof using polymerase chain 
reaction amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

5 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutical^ acceptable 
excipient 

18. A composition of claim 17, wherein the polypeptide has an amino acid sequence selected 
10 from the group consisting of SEQ ID NO:l-26. 

19. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment the composition of 
claim 17. 

15 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaceutically acceptable excipient 

22. A method for treating a disease or condition associated with decreased expression of 
25 functional TRICH, comprising administering to a patient in need of such treatment a composition of 

claim 21. 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

30 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 and 
a pharmaceutically acceptable excipient 
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25. A method for treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment a composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of claim 
5 1 , the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

10 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 
15 b) assessing the activity of the polypeptide of claim 1 in the presence of the test 

compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 

20 presence of the test compound is indicative of a compound that modulates the activity 

of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

25 comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing die expression of the target polynucleotide in the presence of varying 
30 amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 
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b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 

5 polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 

10 indicative of toxicity of the test compound. 

30. A diagnostic test for a condition or disease associated with the expression of TRICH in a 
biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 1 1 , under conditions suitable 
15 for the antibody to bind the polypeptide and form an antibodytpolypeptide complex, 

and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

20 31. The antibody of claim 11, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(aV)2 fragment, or 
25 e) a humanized antibody. 

32. A composition comprising an antibody of claim 11 and an acceptable excipient 

33. A method of diagnosing a condition or disease associated with the expression of TRICH 
30 in a subject, comprising administering to said subject an effective amount of the composition of claim 

32. 

34. A composition of claim 32, wherein the antibody is labeled. 
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35. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
34. 



5 ■ 36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 

11, the method comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected 

from the group consisting of SEQ ID NO:l-26, or an immunogenic fragment thereof, 
under conditions to elicit an antibody response, 
10 b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which binds specifically to a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-26. 



15 37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 



39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
20 11, the method comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-26, or an immunogenic fragment thereof, 
under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

25 c) fusing the antibody producing cells with immortalized cells to form monoclonal 

antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which binds specifically to a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ 

30 IDNO:l-26. 



40. A monoclonal antibody produced by a method of claim 39. 



41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 
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42. The antibody of claim 1 1, wherein the antibody is produced by screening a Fab expressi n 

library. 



43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-26 in a sample, the method comprising: 

a) incubating the antibody of claim 11 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO:l-26 in the sample. 

45. A method of purifying a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-26 from a sample, the method comprising: 

a) incubating the antibody of claim 11 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide having 
an amino acid sequence selected from die group consisting of SEQ ID NO: 1-26. 

46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 

13. 

47. A method of generating a transcript image of a sample which contains polynucleotides, 
the method comprising: 

a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled polynucleotides 
of the sample under conditions suitable for the formation of a hybridization complex, 
and 

c) quantifying the expression of the polynucleotides in the sample. 

48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide 
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or polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
polynucleotide, and wherein said target polynucleotide is a polynucleotide of claim 12. 



49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
5 completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

10 5 1. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

15 53. An array of claim 48, further comprising said target polynucleotide hybridized to a 

nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

20 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 
location have the same sequence, and each distinct physical location on the substrate contains 
nucleotide molecules having a sequence which differs from the sequence of nucleotide molecules at 

25 another distinct physical location on the substrate. 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

30 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 



128 



WO 02/22684 PCT/US01/28938 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

61. A polypeptide of claim 1, comprising the amino acid sequence fSEQIDNO:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ IP NO: 10. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:ll. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

129 
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77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

80. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:27. 

83. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:28. 
15 84. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:29. 

85. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:30. 

86. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:3 1. 

20 

87. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:32. 

88. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:33. 
25 89 . A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:34. 

90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:35. 

91. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:36. 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:37. 

93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:38. 



30 
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94. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:39. 

95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:40. 
5 96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:41. 

97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ IDNO:42. 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:43. 

10 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:44. 

100. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:45. 

15 

101. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:46. 

102. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:47. 

103. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:48. 

25 104. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:49. 

105. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO.50. 

30 

106. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

N051. 
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107. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:52. 
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<110> INCYTE GENOMICS, INC. 
LEE, Ernestine A. 
YUE, Henry 
LAL, Preeti G. 
WALIA, Narinder K. 
BAUGHN, Mariah R. 
WARREN, Bridget A. 
LEE, Sally 
SANJANWALA, Madhu S. 
YAO, Monique G. 
RAMKUMAR, Jayalaxtai 
THORNTON, Michael 
GANDHI, Ameena R. 
POLICKY, Jennifer L. 
ELLIOTT, Vicki S. 
ARVIZU, Chandra 
RAUMANN, Brigitte E. 
BRUNS, Christopher M. 
NAINA, Amir 
HAFALIA, April J. A. 
NGUYEN, Danniel B. 
XU, Yuming 
LU, Dyung Aina M. 
ISON, Craig H. 
GRIFFIN, Jennifer A. 
REDDY, Roopa M. 
BURFORD, Neil 

<120> TRANSPORTERS AND ION CHANNELS 

<130> PI-0217 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/232,685; 60/234,842; 60/236,882; 60/239,057; 60/240,540; 
60/241,700 

<151> 2000-09-15; 2000-09-22; 2000-09-29; 2000-10-05; 2000-13-10; 
2000-10-18 * 

<160> 52 

<170> PERL Program 

<210> 1 
<211> 619 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1687189CD1 

<400> 1 . 

Met Pro Trp Gin Ala Phe Arg Arg Phe Gly Gin Lys Leu Val Arg 

15 10 15 

Arg Arg Thr Leu Glu Ser Gly Met Ala Glu Thr Arg Leu Ala Arg 
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20 25 30 

Cys Leu Ser Thr Leu Asp Leu Val Ala Leu Gly Val Gly Ser Thr 
35 40 45 

Leu Gly Ala Gly Val Tyr Val Leu Ala Gly Glu Val Ala Lys Asp 
50 55 60 

Lys Ala Gly Pro Ser lie Val He Cys Phe Leu Val Ala Ala Leu 
65 70 75 

Ser Ser Val Leu Ala Gly Leu Cys Tyr Ala Glu Phe Gly Ala Arg 
80 85 90 

Val Pro Arg Ser Gly Ser Ala Tyr Leu Tyr Ser Tyr Val Thr Val 
95 100 105 

Gly Glu Leu Trp Ala Phe Thr Thr Gly Trp Asn Leu He Leu Ser 

110 115 120 

Tyr Val He Gly Thr Ala Ser Val Ala Arg Ala Trp Ser Ser Ala 

125 130 135 

Phe Asp Asn Leu He Gly Asn His He Ser Lys Thr Leu Gin Gly 

140 145 150 

Ser He Ala Leu His Val Pro His Val Leu Ala Glu Tyr Pro Asp 

155 160 165 

Phe Phe Ala Leu Gly Leu Val Leu Leu Leu Thr Gly Leu Leu Ala 

170 175 180 

Leu Gly Ala Ser Glu Ser Ala Leu Val Thr Lys Val Phe Thr Gly 

185 190 195 

Val Asn Leu Leu Val Leu Gly Phe Val Met He Ser Gly Phe Val 

200 205 210 

Lys Gly Asp Val His Asn Trp Lys Leu Thr Glu Glu Asp Tyr Glu 

215 220 2-25 

Leu Ala Met Ala Glu Leu Asn Asp Thr Tyr Ser Leu Gly Pro Leu 

230 235 240 

Gly Ser Gly Gly Phe Val Pro Phe Gly Phe Glu Gly He Leu Arg 

245 250 255 

Gly Ala Ala Thr Cys Phe Tyr Ala Phe Val Gly Phe Asp Cys He 

260 265 270 

Ala Thr Thr Gly Glu Glu Ala Gin Asn Pro Gin Arg Ser He Pro 

275 280 285 

Met Gly He Val He Ser Leu Ser Val Cys Phe Leu Ala Tyr Phe 

290 295 300 

Ala Val Ser Ser Ala Leu Thr Leu Met Met Pro Tyr Tyr Gin Leu 

305 310 315 

Gin Pro Glu Ser Pro Leu Pro Glu Ala Phe Leu Tyr He Gly Trp 

320 325 330 

Ala Pro Ala Arg Tyr Val Val Ala Val Gly Ser Leu Cys Ala Leu 

335 340 345 

Ser Thr Ser Leu Leu Gly Ser Met Phe Pro Met Pro Arg Val lie 

350 355 360 

Tyr Ala Met Ala Glu Asp Gly Leu Leu Phe Arg Val Leu Ala Arg 

365 370 375 

He His Thr Gly Thr Arg Thr Pro He He Ala Thr Val Val Ser 

380 385 390 

Gly He He Ala Ala Phe Met Ala Phe Leu Phe Lys Leu Thr Asp 

395 400 405 

Leu Val Asp Leu Met Ser He Gly Thr Leu Leu Ala Tyr Ser Leu 

410 415 420 

Val Ser He Cys Val Leu He Leu Arg Tyr Gin Pro Asp Gin Glu 

425 430 435 

Thr Lys Thr, Gly Glu Glu Val Glu Leu Gin Glu Glu Ala He Thr 
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440 



445 



450 



Thr Glu Ser Glu Lys Leu Thr Leu Trp Gly Leu Phe Phe Pro Leu 

455 460 465 

Asn Ser He Pro Thr Pro Leu Ser Gly Gin He Val Tyr Val Cys 

470 475 480 

Ser Ser Leu Leu Ala Val Leu Leu Thr Ala Leu Cys Leu Val Leu 

485 490 495 

Ala Gin Trp Ser Val Pro Leu Leu Ser Gly Asp Leu Leu Trp Thr 

500 505 -510 

Ala Val Val Val Leu Leu Leu Leu Leu He He Gly He He Val 

515 520 525 

Val He Trp Arg Gin Pro Gin Ser Ser Thr Pro Leu His Phe Lys 

530 535 540 

Val Pro Ala Leu Pro Leu Leu Pro Leu Met Ser He Phe Val Asn 

545 550 555 

He Tyr Leu Met Met Gin Met Thr Ala Gly Thr Trp Ala Arg Phe 

560 565 570 

Gly Val Trp Met Leu He Gly Phe Ala He Tyr Phe Gly Tyr Gly 

575 580 585 

He Gin His Ser Leu Glu Glu He Lys Ser Asn Gin Pro Ser Arg 

590 595 600 

Lys Ser Arg Ala Lys Thr Val Asp Leu Asp Pro Gly Thr Leu Tyr 

605 610 615 

Val His Ser Val 



<210> 2 

<211> 2436 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7078207CD1 

<400> 2 

Met Gly Phe Leu His Gin Leu Gin Leu Leu Leu Trp Lys Asn Val 
15 10 15 

Thr Leu Lys Arg Arg Ser Pro Trp Val Leu Ala Phe Glu He Phe 

20 25 30 

He Pro Leu Val Leu Phe Phe He Leu Leu Gly Leu Arg Gin Lys 

35 40 45 

Lys Pro Thr He Ser Val Lys Glu Val Ser Phe Tyr Thr Ala Ala 

50 55 60 

Pro Leu Thr Ser Ala Gly He Leu Pro Val Met Gin Ser Leu Cys 

65 70 75 

Pro Asp Gly Gin Arg Asp Glu Phe Gly Phe Leu Gin Tyr Ala Asn 

80 85 90 

Ser Thr Val Thr Gin Leu Leu Glu Arg Leu Asp Arg Val Val Glu 

95 100 X05 

Glu Gly Asn Leu Phe Asp Pro Ala Arg Pro Ser Leu Gly Ser Glu 
110 115 120 

Leu Glu Ala Leu Arg Gin His Leu Glu Ala Leu Ser Ala Gly Pro 
125 130 135 

Gly Thr Ser Gly Ser His Leu Asp Arg Ser Thr Val Ser Ser Phe 



140 



145 



150 
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Ser Leu Asp 
Leu Thr Gin 
Leu Ala Ala 
Gly Pro Ser 
Gin Glu Pro 
Glu Glu Leu 
Thr Pro Gly 
Ser Gin Lys 
Gly Gin Ala 
Glu Leu Arg 
Gly Leu Asp 
Pro Pro Arg 
Gin Lys Val 
Leu Leu Pro 
Ala Ser Gly 
Val Met Gly 
Ala Ala Leu 
Phe Val Gin 
Asn Arg Thr 
Ser Leu Gly 
Val His Leu 
Gly Ser Glu 
Ala Phe Val 
lie Ser Ala 
Gin His Leu 
His Pro Glu 
Leu Arg Gin 
Gin Gin Leu 



Ser Val Ala 

155 
Asn Leu Ser 

170 

Arg Val Asp 

185 
Ser Ala Leu 

200 
Trp Ser Arg 

215 
Leu Leu Ala 

230 
Ser Gly Glu 

245 
Gly Ala Leu 

260 
Ala Ala Arg 

275 
Asn Gin Leu 

290 
Ala Pro Asn 

305 
Arg Leu Gin 

320 
Leu Gin Asp 

335 
Gin Gly Ala 

350 
Ala Gly Gly 

365 
Pro Asn Ala 

380 
Ala Thr Pro 

395 
Leu Trp Ala 

410 
lie Glu Pro 

425 
Phe Thr Ser 

440 
Met Thr Ser 

455 
Val Asp Arg 

470 
Gly Asn Val 

485 
Glu lie Arg 

500 
Arg Trp Leu 

515 

Ala Leu Asn 
530 

Asp Asn Phe 

545 
Asp Thr lie 

560 



Arg Asn Pro 
Leu Pro Asn 
Pro Pro Glu 
Asp Ser Gin 
Leu Gly Gly 
Pro Ala Leu 
Leu Gly Arg 
Gin Gly Tyr 
Ala Arg Arg 
Asp Val Ala 
Gly Ser Asp 
Ala Leu Leu 
Val Asp Val 
Cys Thr Gly 
Ala Ala Asn 
Thr Ala* Glu 
Asp Thr Leu 
Gly Leu Gin 
Glu Ala Leu 
Lys Glu Gin 
Asn Pro Lys 
Val lie Leu 
Thr His Tyr 
Ser Phe Leu 
Gin Gin Tyr 
Leu Ser Leu 
Ser Leu Pro 
Asp Asn Ala 



Gin Glu Leu 
160 

Ser Thr Ala 
175 

Val Tyr His 
190 

Ser Gly Leu 
205 

Asn Pro Leu 
220 

Leu Glu Gin 
235 

lie Leu Thr 
250 ' 

Arg Asp Ala 
265 

Phe Ser Gly 
280 

Lys Val Ser 
295 

Ser Ser Pro 
310 

Gly Asp Leu 
325 

Leu Ser Ala 
340 

Arg Thr Pro 
355 

Gly Thr Gly 
370 

Glu Gly Ala 
385 

Gin Gly Gin 
400 

Pro lie Leu 
415 

Arg Arg Gly 
430 

Arg Asn Leu 
445 

lie Leu Tyr 
460 

Lys Ala Asn 
475 

Ala Gin Val 
490 

Glu Gin Gly 
505 

Val Ala Glu 
520 

Asp Glu Leu 
535 

Ser Gly Met 
550 

Ala Cys Gly 
565 



Trp Arg Phe 
165 

Gin Ala Leu 
180 

Leu Leu Phe 
195 

His Lys Gly 
210 

Phe Arg Met 
225 

Leu Thr Cys 
240 

Val Pro Glu 
255 

Val Cys Ser 
270 

Leu Ser Ala 
285 

Gin Gin Leu 
300 

Gin Ala Pro 
315 

Leu Asp Ala 
330 

Leu Ala Leu 
345 

Gly Pro Pro 
360 

Ala Gly Ala 
375 

Pro Ser Ala 
390 

Cys Ser Ala 
405 

Cys Gly Asn 
420 

Asn Met Ser 
435 

Gly Leu Leu 
450 

Ala Pro Ala 
465 

Glu Thr Phe 
480 

Trp Leu Asn 
495 

Arg Leu Gin 
510 

Leu Arg Leu 
525 

Pro Pro Ala 
540 

Ala Leu Leu 
555 

Trp He Gin 
570 
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Phe Met Ser Lys Val Ser Val Asp lie Phe Lys Gly Phe Pro Asp 

575 580 585 

Glu Glu Ser lie Val Asn Tyr Thr Leu Asn Gin Ala Tyr Gin Asp 

590 595 600 

Asn Val Thr Val Phe Ala Ser Val lie Phe Gin Thr Arg Lys Asp 

605 610 615 

Gly Ser Leu Pro Pro His Val His Tyr Lys lie Arg Gin Asn Ser 

620 625 630 

Ser Phe Thr Glu Lys Thr Asn Glu He Arg Arg Ala Tyr Trp Arg 

635 640 645 

Pro Gly Pro Asn Thr Gly Gly Arg Phe Tyr Phe Leu Tyr Gly Phe 

650 655 660 

Val Trp He Gin Asp Met Met Glu Arg Ala He He Asp Thr Phe 

665 670 675 

Val Gly His Asp Val Val Glu Pro Gly Ser Tyr Val Gin Met Phe 

680 685 690 

Pro Tyr Pro Cys Tyr Thr Arg Asp Asp Phe Leu Phe Val He Glu 

695 700 705 

His Met Met Pro Leu Cys Met Val He Ser Trp Val Tyr Ser Val 

710 715 720 

Ala Met Thr He Gin His He Val Ala Glu Lys Glu His Arg Leu 

725 730 735 

Lys Glu Val Met Lys Thr Met Gly Leu Asn Asn Ala Val His Trp 

740 745 750 

Val Ala Trp Phe He Thr Gly Phe Val Gin Leu Ser He Ser Val 

755 760 765 

Thr Ala Leu Thr Ala He Leu Lys Tyr Gly Gin Val Leu Met His 

770 775 780 

Ser His Val Val He He Trp Leu Phe Leu Ala Val Tyr Ala Val 

785 790 795 

Ala Thr He Met Phe Cys Phe Leu Val Ser Val Leu Tyr Ser Lys 

800 805 810 

Ala Lys Leu Ala Ser Ala Cys Gly Gly He He Tyr Phe Leu Ser 

815 820 825 

Tyr Val Pro Tyr Met Tyr Val Ala He Arg Glu Glu Val Ala His 

830 835 840 

Asp Lys He Thr Ala Phe Glu Lys Cys He Ala Ser Leu Met Ser 

845 850 855 

Thr Thr Ala Phe Gly Leu Gly Ser Lys Tyr Phe Ala Leu Tyr Glu 

860 865 870 

Val Ala Gly Val Gly He Gin Trp His Thr Phe Ser Gin Ser Pro 

875 880 885 

Val Glu Gly Asp Asp Phe Asn Leu Leu Leu Ala Val Thr Met Leu 

890 895 900 

Met Val Asp Ala Val Val Tyr Gly He Leu Thr Trp Tyr He Glu 

905 910 915 

Ala Val His Pro Gly Met Tyr Gly Leu Pro Arg Pro Trp Tyr Phe 

920 925 930 

Pro Leu Gin Lys Ser Tyr Trp Leu Gly Ser Gly Arg Thr Glu Ala 

935 940 945 

Trp Glu Trp Ser Trp Pro Trp Ala Arg Thr Pro Arg Leu Ser Val 

950 955 960 

Met Glu Glu Asp Gin Ala Cys Ala Met Glu Ser Arg Arg Phe Glu 

965 970 $75 

Glu Thr Arg Gly Met Glu Glu Glu Pro Thr His Leu Pro Leu Val 

980 985 990 
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Val Cys Val Asp Lys Leu Thr Lys Val Tyr Lys Asp Asp Lys Lys 
995 1000 1005 

Leu Ala Leu Asn Lys Leu Ser Leu Asn Leu Tyr Glu Asn Gin Val 

1010 1015 1020 

Val Ser Phe Leu Gly His Asn Gly Ala Gly Lys Thr Thr Thr Met 

1025 1030 1035 

Ser lie Leu Thr Gly Leu Phe Pro Pro Thr Ser Gly Ser Ala Thr 

1040 1045 1050 

lie Tyr Gly His Asp lie Arg Thr Glu Met Asp Glu lie Arg Lys 

1055 1060 1065 

Asn Leu Gly Met Cys Pro Gin His Asn Val Leu Phe Asp Arg Leu 

1070 1075 1080 

Thr Val Glu Glu His Leu Trp Phe Tyr Ser Arg Leu Lys Ser Met 

1085 1090 1095 

Ala Gin Glu Glu He Arg Arg Glu Met Asp Lys Met He Glu Asp 

1100 1105 1110 

Leu Glu Leu Ser Asn Lys Arg His Ser Leu Val Gin Thr Leu Ser 

1115 1120 1125 

Gly Gly Met Lys Arg Lys Leu Ser Val Ala He Ala Phe Val Gly 

1130 1135 1140 

Gly Ser Arg Ala He He Leu Asp Glu Pro Thr Ala Gly Val Asp 

1145 ' 1150 1155 

Pro Tyr Ala Arg Arg Ala He Trp Asp Leu He Leu Lys Tyr Lys 

1160 1165 1170 

Pro Gly Arg Thr He Leu Leu Ser Thr His His Met Asp Glu Ala 

1175 1180 1185 

Asp Leu Leu Gly Asp Arg He Ala He lie Ser His Gly Lys Leu 

1190 1195 1200 

Lys Cys Cys Gly Ser Pro Leu Phe Leu Lys Gly Thr Tyr Gly Asp 

1205 1210 1215 

Gly Tyr Arg Leu Thr Leu Val Lys Arg Pro Ala Glu Pro Gly Gly 

1220 1225 1230 

Pro Gin Glu Pro Gly Leu Ala Ser Ser Pro Pro Gly Arg Ala Pro 

1235 1240 1245 

Leu Ser Ser Cys Ser Glu Leu Gin Val Ser Gin Phe He Arg Lys 

1250 1255 1260 

His Val Ala Ser Cys Leu Leu Val Ser Asp Thr Ser Thr Glu Leu 

1265 1270 1275 

Ser Tyr He Leu Pro Ser Glu Ala Ala Lys Lys Gly Ala Phe Glu 

1280 1285 1290 

Arg Leu Phe Gin His Leu Glu Arg Ser Leu Asp Ala Leu His Leu 

1295 1300 1305 

Ser Ser Phe Gly Leu Met Asp Thr Thr Leu Glu Glu Val Phe Leu 

1310 1315 1320 

Lys Val Ser Glu Glu Asp Gin Ser Leu Glu Asn Ser Glu Ala Asp 

1325 1330 1335 

Val Lys Glu Ser Arg Lys Asp Val Leu Pro Gly Ala Glu Gly Pro 

1340 1345 1350 

Ala Ser Gly Glu Gly His Ala Gly Asn Leu Ala Arg Cys Ser Glu 

1355 1360 1365 

Leu Thr Gin Ser Gin Ala Ser Leu Gin Ser Ala Ser Ser Val Gly 

1370 1375 1380 

Ser Ala Arg Gly Asp Glu Gly Ala Gly Tyr Thr Asp Val Tyr Gly 



Asp Tyr 



1385 
Arg Pro Leu 
1400 



Phe 



Asp 



Asn 



1390 
Pro Gin 
1405 



Asp 



Pro 



Asp 



1395 
Asn Val 
1410 
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Ser Leu Gin Glu Val Glu Ala Glu Ala Leu Ser Arg Val Gly Gin 

1415 1420 1425 

Gly Ser Arg Lys Leu Asp Gly Gly Trp Leu Lys Val Arg Gin Phe 

1430 1435 1440 

His Gly Leu Leu Val Lys Arg Phe His Cys Ala Arg Arg Asn Ser 

1445 1450 1455 

Lys Ala Leu Phe Ser Gin lie Leu Leu Pro Ala Phe Phe Val Cys 

1460 ' 1465 1470 

Val Ala Met Thr Val Ala Leu Ser Val Pro Glu He Gly Asp Leu 

1475 1480 1485 

Pro Pro Leu Val Leu Ser Pro Ser Gin Tyr His Asn Tyr Thr Gin 

1490 1495 1500 

Pro Arg Gly Asn Phe He Pro Tyr Ala Asn Glu Glu Arg Arg Glu 

1505 1510 1515 

Tyr Arg Leu Arg Leu Ser Pro Asp Ala Ser Pro Gin Gin Leu Val 

1520 1525 1530 

Ser Thr Phe Arg Leu Pro Ser Gly Val Gly Ala Thr Cys Val Leu 

1535 1540 1545 

Lys Ser Pro Ala Asn Gly Ser Leu Gly Pro Thr Leu Asn Leu Ser 

1550 1555 1560 

Ser Gly Glu Ser Arg Leu Leu Ala Ala Arg Phe Phe Asp Ser Met 

1565 1570 1575 

Cys Leu Glu Ser Phe Thr Gin Gly Leu Pro Leu Ser Asn Phe Val 



Asp Glu Asp Leu Gin Ala Trp Asn Val Ser Leu Pro Pro Thr Ala 

1610 1615 1620 

Gly Pro Glu Met Trp Thr Ser Ala Pro Ser Leu Pro Arg Leu Val 

1625 1630 1635 

Arg Glu Pro Val Arg Cys Thr Cys Ser Ala Gin Gly Thr Gly Phe 

1640 1645 1650 

Ser Cys Pro Ser Ser Val Gly Gly His Pro Pro Gin Met Arg Val 

1655 1660 1665 

Val Thr Gly Asp He Leu Thr Asp He Thr Gly His Asn Val Ser 

1670 1675 1680 

Glu Tyr Leu Leu Phe Thr Ser Asp Arg Phe Arg Leu His Arg Tyr 

1685 1690 1695 

Gly Ala He Thr Phe Gly Asn Val Leu Lys Ser He Pro Ala Ser 

1700 1705 1710 

Phe Gly Thr Arg Ala Pro Pro Met Val Arg Lys lie Ala Val Arg 

1715 1720 1725 

Arg Ala Ala Gin Val Phe Tyr Asn Asn Lys Gly Tyr His Ser Met 

1730 1735 1740 

Pro Thr Tyr Leu Asn Ser Leu Asn Asn Ala He Leu Arg Ala Asn 

1745 1750 1755 

Leu Pro Lys Ser Lys Gly Asn Pro Ala Ala Tyr Gly He Thr Val 

1760 1765 1770 

Thr Asn His Pro Met Asn Lys Thr Ser Ala Ser Leu Ser Leu Asp 

1775 1780 1785 

Tyr Leu Leu Gin Gly Thr Asp Val Val He Ala He Phe He He 

1790 1795 1800 

Val Ala Met Ser Phe Val Pro Ala Ser Phe Val Val Phe Leu Val 

1805 1810 1815 

Ala Glu Lys Ser Thr Lys Ala Lys His Leu Gin Phe Val Ser Gly 



Pro Pro Pro 



1580 
Pro Ser 
1595 



Pro Ala Pro 



1585 1590 
Ser Asp Ser Pro Ala Ser Pro 
1600 1605 



1820 



1825 



1830 
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Cve Aqti Pto Tl o Tit* 
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iy / d 


1 QOn 
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Trp Asp 
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i qo n 

iyy u 
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xy y b 


Met Ala val Glu Giy 


Val 


Val 


Gly 


Phe Leu 


Leu inr ne Met cys 


2000 








2005 


om n 


Gin Tyr Asn Phe Leu 


Arg Arg 


Pro 


Gin Arg 


Met Fro vai ber inr 


2015 








2020 


2025 


Lys Pro Val Glu Asp 


Asp 


Val 


Asp 


Val Ala 


Ser Glu Arg Gin Arg 


2030 








2035 


2040 


Val Leu Arg Gly Asp 


Ala 


Asp 


Asn 


Asp Met 


vai Lys lie Glu Asn 


2045 








2050 


once 


Leu Thr Lys Val Tyx 


Lys 


Ser Arg 


Lys He 


Gly Arg He Leu Ala 


2060 








2065 


2U /U 


Val Asp Arg Leu Cys 


Leu Gly Val 


Arg Pro 


Gly Glu Cys Phe Gly 










ZUoU 




Leu Leu Gly Val Asn 


Gly Ala Gly 


Lys Thr 


ber inr Fne iiys net 










i no c 
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Leu Thr Gly Asp Glu 
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Thr 


Thr 
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9 1 AC\ 
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A^*** T All 
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91 ^0 








91 1;<? 


91 fin 


JjVS nop ulll xxia nivj 


Val 


Val 


Lys 


IX L/ Ala 


Xtcll V?XU l^jfo XicU vrXH 


2165 








9i in 


917^ 


t.oti TVit* T.v*3 Tvt*~ Ala 

llCU 1 111 XJJfO A Jf J- 


Asp 


Lys 


Pro 


Ala m \/ 

AIR V7XV 


X 111 X jr 1 VJX jf V31_y 


2180 








91 

ulu J 


91 Qfl 
j&X J u 


A o T-k T.VC A"l**fT T.\7*3 T ,<ai 1 


Ser 


Thr 


Ala 


Tl A Ala 

lie Ala 


T.aii 7*1 o \7 THry" P>"rt 

lieu xxc vji_y iyi rxu 


91 OR 












Ala Phe lie Phe Leu 


Asp 


Glu 


Pro 


Thr Thr 


Gly Met Asp Pro Lys 


2210 








2215 


2220 


Ala Arg Arg Phe Leu 


Trp 


Asn 


Leu 


He Leu 


Asp Leu He Lys Thr 


2225 








2230 


2235 


Gly Arg Ser Val Val 


Leu 


Thr 


Ser 


His Ser 


Met Glu Glu Cys Glu 


2240 








2245 


2250 
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Ala Leu Cys Thr Arg Leu Ala He Met Val Asn Gly Arg Leu Arg 

2255 2260 2265 

Cys Leu Gly Ser He Gin His Leu Lys Asn Arg Phe Gly Asp Gly 

2270 2275 2280 

Tyr Met He Thr Val Arg Thr Lys Ser Ser Gin Ser Val Lys Asp 

2285 2290 2295 

Val Val Arg Phe Phe Asn Arg Asn Phe Pro Glu Ala Met Leu Lys 

2300 2305 2310 

Glu Arg His His Thr Lys Val Gin Tyr Gin Leu Lys Ser Glu His 

2315 2320 2325 

He Ser Leu Ala Gin Val Phe Ser Lys Met Glu Gin Val Ser Gly 

2330 2335 2340 

Val Leu Gly He Glu Asp Tyr Ser Val Ser Gin Thr Thr Leu Asp 

2345 2350 2355 

Asn Val Phe Val Asn Phe Ala Lys Lys Gin Ser Asp Asn Leu Glu 

2360 2365 2370 

Gin Gin Glu Thr Glu Pro Pro Ser Ala Leu Gin Ser Pro Leu Gly 

2375 2380 2385 

Cys Leu Leu Ser Leu Leu Arg Pro Arg Ser Ala Pro Thr Glu Leu 

2390 2395 2400 

Arg Ala Leu Val Ala Asp Glu Pro Glu Asp Leu Asp Thr Glu Asp 

2405 2410 2415 

Glu Gly Leu He Ser Phe Glu Glu Glu Arg Ala Gin Leu Ser Phe 

2420 2425 2430 

Asn Thr Asp Thr Leu Cys 

2435 



<210> 3 
<211> 610 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1560619CD1 



<400> 3 

Met Ser Arg Ser Pro Leu Asn Pro Ser Gin Leu Arg Ser Val Gly 

1 5 10 15 

Ser Gin Asp Ala Leu Ala Pro Leu Pro Pro Pro Ala Pro Gin Asn 

20 25 30 

Pro Ser Thr His Ser Trp Asp Pro Leu Cys Gly Ser Leu Pro Trp 

35 40 45 

Gly Leu Ser Cys Leu Leu Ala Leu Gin His Val Leu Val Met Ala 

50 55 60 

Ser Leu Leu Cys Val Ser His Leu Leu Leu Leu Cys Ser Leu Ser 

65 70 75 

Pro Gly Gly Leu Ser Tyr Ser Pro Ser Gin Leu Leu Ala Ser Ser 

80 85 90 

Phe Phe Ser Cys Gly Met Ser Thr He Leu Gin Thr Trp Met Gly 

95 100 105 

Ser Arg Leu Pro Leu Val Gin Ala Pro Ser Leu Glu Phe Leu He 

110 115 120 

Pro Ala Leu Val Leu Thr Ser Gin Lys Leu Pro Arg Ala He Gin 

125 130 135 

Thr Pro Gly Asn Ser Ser Leu Met Leu His Leu Cys Arg Gly Pro 
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140 


145 








150 


Ser Cys His 


Gly Leu 


Gly His Trp Asn Thr 


Ser 


Leu 


Gin 


Glu Val 




155 


160 








165 


Ser Gly Ala Val Val 


Val Ser Gly Leu Leu 


Gin Gly Met Met Gly 




170 


175 








180 


Leu Leu Gly Ser Pro 


Gly His Val Phe Pro 


His 


Cys Gly Pro Leu 




185 


190 








195 


Val Leu Ala 


Pro Ser 


Leu Val Val Ala Gly 


Leu 


Ser 


Ala 


His Arg 




200 


205 








210 


Glu Val Ala 


Gin Phe 


Cys Phe Thr His Trp 


Gly Leu Ala Leu Leu 




215 


220 








225 


Val He Leu 


Leu Met 


Val Val Cys Ser Gin 


His 


Leu Gly Ser Cys 




230 


235 








240 


Gin Phe His 


Val Cys 


Pro Trp Arg Arg Ala 


Ser 


Thr 


Ser 


Ser Thr 




245 


250 








255 


His Thr Pro 


Leu Pro 


Val Phe Arg Leu Leu 


Ser 


Val 


Leu 


He Pro 




260 


265 








270 


Val Ala Cys 


Val Trp 


He Val Ser Ala Phe 


Val 


Gly Phe Ser Val 




275 


280 








285 


He Pro Gin 


Glu Leu 


Ser Ala Pro Thr Lys 


Ala 


Pro 


Trp 


He Trp 




290 


295 








300 


Leu Pro His 


Pro Gly 


Glu Trp Asn Trp Pro 


Leu 


Leu 


Thr 


Pro Arg 




305 


310 








315 


Ala Leu Ala 


Ala Gly 


He Ser Met Ala Leu 


Ala 


Ala 


Ser 


Thr Ser 




320 


325 








330 


Ser Leu Gly Cys Tyr 


Ala Leu Cys Gly Arg 


Leu 


Leu 


His 


Leu Pro 




335 


340 








345 


Pro Pro Pro 


Pro His 


Ala Cys Ser Arg Gly 


Leu Ser Leu Glu Gly 




350 


355 








360 


Leu Gly Ser Val Leu 


Ala Gly Leu Leu Gly 


Ser Pro Met Gly Thr 




365 


370 








375 


Ala Ser Ser 


Phe Pro 


Asn Val Gly Lys Val 


Gly Leu 


He 


Gin Ala 




380 


385 








390 


Gly Ser Gin Gin Val 


Ala His Leu Val Gly 


Leu 


Leu 


Cys 


Val Gly 




395 


400 








405 


Leu Gly Leu 


Ser Pro 


Arg Leu Ala Gin Leu 


Leu 


Thr 


Thr 


He Pro 




410 


415 








420 


Leu Pro Val 


Val Gly 


Gly Val Leu Gly Val 


Thr 


Gin 


Ala 


Val Val 




425 


430 








435 


Leu Ser Ala Gly Phe 


Ser Ser Phe Tyr Leu 


Ala 


Asp 


He 


Asp Ser 




440 


445 








450 


Gly Arg Asn 


He Phe 


He Val Gly Phe Ser 


He 


Phe 


Met 


Ala Leu 




455 


460 








465 


Leu Leu Pro 


Arg Trp 


Phe Arg Glu Ala Pro 


Val 


Leu 


Phe 


Ser Thr 




470 


475 








480 


Gly Trp Ser 


Pro Leu 


Asp Val Leu Leu His 


Ser 


Leu 


Leu 


Thr Gin 




485 


490 








495 


Pro lie Phe 


Leu Ala 


Gly Leu Ser Gly Phe 


Leu 


Leu 


Glu 


Asn Thr 




500 


505 








510 


He Pro Gly Thr Gin 


Leu Glu Arg Gly Leu 


Gly Gin Gly Leu Pro 




515 


520 








525 


Ser Pro Phe 


Thr Ala 


Gin Glu Ala Arg Met 


Pro Gin Lys 


Pro Arg 




530 


535 








540 


Glu Lys Ala Ala Gin 


Val Tyr Arg Leu Pro 


Phe 


Pro 


He 


Gin Asn 




545 


550 








555 


Leu Cys Pro 


Cys He 


Pro Gin Pro Leu His 


Cys 


Leu Cys 


Pro Leu 
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560 



565 



570 



Pro Glu Asp Pro Gly Asp Glu Glu Gly Gly Ser Ser Glu Pro Glu 
575 580 585 

Glu Met Ala Asp Leu Leu Pro Gly Ser Gly Glu Pro Cys Pro Glu 
590 595 600 

Ser Ser Arg Glu Gly Phe Arg Ser Gin Lys 



<210> 4 
<211> 372 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2614283CD1 

<400> 4 

Met Glu Ala Lys Glu Lys Gin His Leu Leu Asp Ala Arg Pro Ala 
15 10 15 

lie Arg Ser Tyr Thr Gly Ser Leu Trp Gin Glu Gly Ala Gly Trp 
20 25 30 

lie Pro Leu Pro Arg Pro Gly Leu Asp Leu Gin Ala lie Glu Leu 
35 40 45 

Ala Ala Gin Ser Asn His His Cys His Ala Gin Lys Gly Pro Asp 
50 55 60 

Ser His Cys Asp Pro Lys Lys Gly Lys Ala Gin Arg Gin Leu Tyr 
65 70 75 

Val Ala Ser Ala He Cys Leu Leu Phe Met He Gly Glu Val Val 
80 85. 90 

Gly Gly Tyr Leu Ala His Ser Leu Ala Val Met Thr Asp Ala Ala 
95 100 105 

His Leu Leu Thr Asp Phe Ala Ser Met Leu He Ser Leu Phe Ser 

110 115 120 

Leu Trp Met Ser Ser Arg Pro Ala Thr Lys Thr Met Asn Phe Gly 

125 130 135 

Trp Gin Arg Ala Glu He Leu Gly Ala Leu Val Ser Val Leu Ser 

140 145 150 

He Trp Val Val Thr Gly Val Leu Val Tyr Leu Ala Val Glu Arg 

155 160 165 

Leu He Ser Gly Asp Tyr Glu He Asp Gly Gly Thr Met Leu He 

170 175 180 

Thr Ser Gly Cys Ala Val Ala Val Asn He He Met Gly Leu Thr 

185 190 195 

Leu His Gin Ser Gly His Gly His Ser His Gly Thr Thr Asn Gin 

200 205 210 

Gin Glu Glu Asn Pro Ser Val Arg Ala Ala Phe He His Val He 

215 220 225 

Gly Asp Phe Met Gin Ser Met Gly Val Leu Val Ala Ala Tyr He 

230 235 240 

Leu Tyr Phe Lys Pro Glu Tyr Lys Tyr Val Asp Pro He Cys Thr 

245 250 255 

Phe Val Phe Ser He Leu Val Leu Gly Thr Thr Leu Thr He Leu 

260 265 270 

Arg Asp Val He Leu Val Leu Met Glu Gly Thr Pro Lys Gly Val 



605 



610 



275 



280 



285 
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Asp Phe Thr Ala Val Arg Asp 
290 



Leu Leu Leu Ser Val Glu Gly Val 
295 300 



Glu Ala Leu His Ser Leu His He Trp Ala Leu Thr Val Ala Gin 
305 310 315 

Pro Val Leu Ser Val His He Ala He Ala Gin Asn Thr Asp Ala 
320 325 330 

Gin Ala Val Leu Lys Thr Ala Ser Ser Arg Leu Gin Gly Lys Phe 
335 340 345 

His Phe His Thr Val Thr He Gin He Glu Asp Tyr Ser Glu Asp 



<210> 5 

<211> 490 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2667691CD1 

<400> 5 . 

Met Thr Gin Gly Lys Lys Lys Lys Arg Ala Ala Asn Arg Ser He 
1 5 10 15 

Met Leu Ala Lys Lys He He He Lys Asp Gly Gly Thr Pro Gin 
20 25 30 

Gly He Gly Ser Pro Ser Val Tyr His Ala Val He Val He Phe 
35 40 45 

Leu Glu Phe Phe Ala Trp Gly Leu Leu Thr Ala Pro Thr Leu Val 
50 55 60 

Val Leu His Glu Thr Phe Pro Lys His Thr Phe Leu Met Asn Gly 
65 . 70 75 

Leu He Gin Gly Val Lys Gly Leu Leu Ser Phe Leu Ser Ala Pro 
80 85 90 

Leu He Gly Ala Leu Ser Asp Val Trp Gly Arg Lys Ser Phe Leu 
95 100 105 

Leu Leu Thr Val Phe Phe Thr Cys Ala Pro He Pro Leu Met Lys 
110 115 120 

He Ser Pro Trp Trp Tyr Phe Ala Val He Ser Val Ser Gly Val 
125 130 135 

Phe Ala Val Thr Phe Ser Val Val Phe Ala Tyr Val Ala Asp He 
140 145 150 

Thr Gin Glu His Glu Arg Ser Met Ala Tyr Gly Leu Val Ser Ala 
155 * 160 165 

Thr Phe Ala Ala Ser Leu Val Thr Ser Pro Ala He Gly Ala Tyr 
170 175 180 

Leu Gly Arg Val Tyr Gly Asp Ser Leu Val Val Val Leu Ala Thr 
185 190 195 

Ala He Ala Leu Leu Asp He Cys Phe He Leu Val Ala Val Pro 
200 205 210 

Glu Ser Leu Pro Glu Lys Met Arg Pro Ala Ser Trp Gly Ala Pro 
215 220 225 

He Ser Trp Glu Gin Ala Asp Pro Phe Ala Ser Leu Lys Lys Val 



350 

Met Lys Asp Cys Gin Ala Cys 
365 



355 

Gin Gly Pro Ser Asp 
370 



360 



230 

Gly Gin Asp Ser He Val Leu 



235 240 
Leu He Cys He Thr Val Phe Leu 
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245 250 255 

Ser Tyr Leu Pro Glu Ala Gly Gin Tyr Ser Ser Phe Phe Leu Tyr 

260 265 270 

Leu Arg Gin lie Met Lys Phe Ser Pro Glu Ser Val Ala Ala Phe 

275 280 285 

He Ala Val Leu Gly He Leu Ser He He Ala Gin Thr He Val 

290 295 300 

Leu Ser Leu Leu Met Arg Ser He Gly Asn Lys Asn Thr He Leu 

305 310 315 

Leu Gly Leu Gly Phe Gin He Leu Gin Leu Ala Trp Tyr Gly Phe 

320 325 330 

Gly Ser Glu Pro Trp Met Met Trp Ala Ala Gly Ala Val Ala Ala 

335 340 345 

Met Ser Ser He Thr Phe Pro Ala Val Ser Ala Leu Val Ser Arg 

350 355 360 

Thr Ala Asp Ala Asp Gin Gin Gly Val Val Gin Gly Met He Thr 

365 370 375 

Gly He Arg Gly Leu Cys Asn Gly Leu Gly Pro Ala Leu Tyr Gly 

380 385 390 

Phe He Phe Tyr He Phe His Val Glu Leu Lys Glu Leu Pro He 

395 400 405 

Thr Gly Thr Asp Leu Gly Thr Asn Thr Ser Pro Gin His His Phe 

410 415 420 

Glu Gin Asn Ser He He Pro Gly Pro Pro Phe Leu Phe Gly Ala 

425 430 435 

Cys Ser Val Leu Leu Ala Leu Leu Val Ala Leu Phe He Pro Glu 

440 445 450 

His Thr Asn Leu Ser Leu Arg Ser Ser Ser Trp Arg Lys His Cys 

455 460 465 

Gly Ser His Ser His Pro His Asn Thr Gin Ala Pro Gly Glu Ala 

470 475 480 

Lys Glu Pro Leu Leu Gin Asp Thr Asn Val 

485 490 



<210> 6 

<211> 377 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3211415CD1 



<400> 6 

Met Leu Pro Leu Ser He Lys Asp Asp Glu Tyr Lys Pro Pro Lys 

1 5 10 15 

Phe Asn Leu Phe Gly Lys He Ser Gly Trp Phe Arg Ser He Leu 

20 25 30 

Ser Asp Lys Thr Ser Arg Asn Leu Phe Phe Phe Leu Cys Leu Asn 

35 40 45 

Leu Ser Phe Ala Phe Val Glu Leu Leu Tyr Gly He Trp Ser Asn 

50 55 60 

Cys Leu Gly Leu He Ser Asp Ser Phe His Met Phe Phe Asp Ser 

65 70 75 

Thr Ala He Leii Ala Gly Leu Ala Ala Ser Val He Ser Lys Trp 

80 85 90 
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Arg Asp Asn Asp Ala Phe Ser Tyr Gly Tyr Val Arg Ala Glu Val 
95 100 105 

Leu Ala Gly Phe Val Asn Gly Leu Phe Leu lie Phe Thr Ala Phe 

110 115 120 

Phe lie Phe Ser Glu Gly Val Glu Arg Ala Leu Ala Pro Pro Asp 

125 130 135 

Val His His Glu Arg Leu Leu Leu Val Ser lie Leu Gly Phe Val 

140 145 150 

Val Asn Leu He Gly He Phe Val Phe Lys His Gly Gly His Gly 

155 160 165 

His Ser His Gly Ser Gly Gly His Gly His Ser His Ser Leu Phe 

170 175 180 

Asn Gly Ala Leu Asp Gin Ala His Gly His Val Asp His Cys His 

185 190 195 

Ser His Glu Val Lys His Gly Ala Ala His Ser His Asp His Ala 

200 205 210 

His Gly His Gly His Phe His Ser His Asp Gly Pro Ser Leu Lys 

215 220 225 

Glu Thr Thr Gly Pro Ser Arg Gin He Leu Gin Gly Val Phe Leu 

230 235 240 

His He Leu Ala Asp Thr Leu Gly Ser He Gly Val He Ala Ser 

245 250 255 

Ala He Met Met Gin Asn Phe Gly Leu Met He Ala Asp Pro He 

260 265 270 

Cys Ser He Leu He Ala He Leu He Val Val Ser Val He Pro 

275 280 285 

Leu Leu Arg Glu Ser Val Gly He Leu Met Gin Arg Thr Pro Pro 

290 295 300 

Leu Leu Glu Asn Ser Leu Pro Gin Cys Tyr Gin Arg Val Gin Gin 

305 310 315 

Leu Gin Gly Val Tyr Ser Leu Gin Glu Gin His Phe Trp Thr Leu 

320 325 330 

Cys Ser Asp Val Tyr Val Gly Thr Leu Lys Leu He Val Ala Pro 

335 340 345 

Asp Ala Asp Ala Arg Trp He Leu Ser Gin Thr His Asn He Phe 

350 355 360 

Thr Gin Ala Gly Val Arg Gin Leu Tyr Val Gin He Asp Phe Ala 

365 370 375 

Ala Met 



<210> 7 

<211> 340 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 4739923CD1 

<400> 7 

Met Ala Asp Thr Ala Thr Thr Ala 

1 5 
Ala Ser Ala Ser Ser Asp Ala Pro 
20 

Arg Phe Gin Gin Thr Ser Phe Tyr 



Ser Ala Ala Ala Ala Ser Ala 
10 15 

Pro Phe Gin Leu Gly Lys Pro 
25 30 

Gly Arg Phe Arg His Phe Leu 
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35 40 45 

Asp He He Asp Pro Arg Thr Leu Phe Val Thr Glu Arg Arg Leu 
50 55 60 

Arg Glu Ala Val Gin Leu Leu Glu Asp Tyr Lys His Gly Thr Leu 
65 70 75 

Arg Pro Gly Val Thr Asn Glu Gin Leu Trp Ser Ala Gin Lys He 
80 85 90 

Lys Gin Ala He Leu His Pro Asp Thr Asn Glu Lys He Phe Met 
95 100 105 

Pro Phe Arg Met Pro Gly Tyr He Pro Phe Gly Thr Pro He Val 

HO 115 120 

Val Gly Leu Leu Leu Pro Asn Gin Thr Leu Ala Ser Thr Val Phe 

125 130 135 

Trp Gin Trp Leu Asn Gin Ser His Asn Ala Cys Val Asn Tyr Ala 

140 145 150 

Asn Arg Asn Ala Thr Lys Pro Ser Pro Ala Ser Lys Phe He Gin 

155 160 165 

Gly Tyr Leu Gly Ala Val He Ser Ala Val Ser He Ala Val Gly 

170 175 180 

Leu Asn Val Leu Val Gin Lys Ala Asn Lys Leu Thr Pro Ala Thr 

185 190 195 

Arg Leu Leu He Gin Arg Phe Val Pro Phe Pro Ala Val Ala Ser 

200 205 210 

Ala Asn He Cys Asn Val Val Leu Met Arg Tyr Gly Glu Leu Glu 

215 220 225 

Glu Gly He Asp Val Leu Asp Ser Asp Gly Asn Leu Val Gly Ser 

230 235 240 

Ser Lys He Ala Ala Arg His Ala Leu Leu Glu Thr Ala Leu Thr 

245 250 255 

Arg Val Val Leu Pro Met Pro He Leu Val Leu Pro Pro He Val 

260 265 270 

Met Ser Met Leu Glu Lys Thr Ala Leu Leu Gin Ala Arg Pro Arg 

275 280 285 

Leu Leu Leu Pro Val Gin Ser Leu Val Cys Leu Ala Ala Phe Gly 

290 295 300 

Leu Ala Leu Pro Leu Ala He Ser Leu Phe Pro Gin Met Ser Glu 

305 310 315 

He Glu Thr Ser Gin Leu Glu Pro Glu He Ala Gin Ala Thr Ser 

320 325 330 

Ser Arg Thr Val Val Tyr Asn Lys Gly Leu 

335 340 

<210> 8 

<211> 1274 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 55030459CD1 

<400> 8 

Met Ala Arg Gin Pro Glu Glu Glu Glu Thr Ala Val Ala Arg Ala 
15 10 15 

Arg Arg Pro Pro Leu Trp Leu Leu Cys Leu Val Ala Cys Trp Leu 
20 -25 30 
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Leu Gly 


Aj.a ijj.y 


Ala 


Glu Ala Asp 


Phe 


Ser 


He Leu Asp Glu Ala 






o c 
OD 








40 


45 


CZl v-| TT a 1 


jjgu ax a 


Ser 


Gin Met Arg Arg 


Leu Ala Ala Glu Glu Leu 














55 


60 




vai inr 


Met 


Gin Arg lie 


Phe 


Asn 


Ser Phe Val Tyr Thr 






c c 
03 








70 


75 


Glu Lys 


lie ber 


Asn 


Gly Glu Ser Glu 


Val 


Gin Gin Leu Ala Lys 






80 








85 


90 


Lys lie 


Arg Glu 


Lys 


Phe Asn Arg Tyr 


Leu Asp Val Val Asn Arg 






95 








100 


105 


Asn Lys 


Gin Val 


Val 


Glu Ala Ser 


Tyr 


Thr 


Ala His Leu Thr Ser 






110 








115 


120 


Pro Leu 


Thr Ala 


He 


Gin Asp Cys 


Cys 


Thr 


He Pro Pro Ser Met 






125 








130 


135 


Met Glu 


Phe Asp Gly 


Asn 


Phe Asn 


Thr 


Asn 


Val Ser Arg Thr He 






140 








145 


150 


Ser Cys 


Asp Arg 


Leu 


Ser 


Thr Thr 


Val 


Asn Ser Arg Ala Phe Asn 






155 








160 


165 


Pro Gly 


Arg Asp 


Leu 


Asn 


Ser Val 


Leu 


Ala Asp Asn Leu Lys Ser 






170 








175 


180 


Asn Pro 


Gly lie 


Lys 


Trp Gin Tyr Phe 


Ser 


Ser Glu Glu Gly He 






185 








190 


195 


Phe Thr 


Val Phe 


Pro 


Ala 


His Lys 


Phe 


Arg Cys Lys Gly Ser Tyr 






200 








205 


210 


Glu His 


Arg Ser 


Arg 


Pro 


He Tyr 


Val 


Ser 


Thr Val Arg Pro Gin 






215 








220 


225 


Ser Lys 


His lie 


Val 


Val 


He Leu 


Asp 


His 


Gly Ala Ser Val Thr 






230 








235 


240 


Asp Thr 


Gin Leu 


Gin 


He 


Ala Lys 


Asp 


Ala 


Ala Gin Val He Leu 






245 








250 


255 


Ser Ala 


lie Asp Glu 


His 


Asp Lys 


He 


Ser 


Val Leu Thr Val Ala 






260 








265 


270 


Asp Thr 


Val Arg 


Thr 


Cys 


Ser Leu 


Asp 


Gin 


Cys Tyr Lys Thr Phe 






275 








280 


285 


Leu Ser 


Pro Ala 


Thr 


Ser 


Glu Thr 


Lys 


Arg 


Lys Met Ser Thr Phe 






290 








295 


300 


Val Ser 


Ser Val 


Lys 


Ser 


Ser Asp 


Ser 


Pro 


Thr Gin His Ala Val 






305 








310 


315 


Gly Phe 


Gin Lys Ala 


Phe 


Gin Leu 


He 


Arg 


Ser Thr Asn Asn Asn 






320 








325 


330 


Thr Lys 


Phe Gin 


Ala 


Asn Thr Asp 


Met 


Val 


He He Tyr Leu Ser 






335 








340 


345 


Ala Gly 


lie Thr 


Ser 


Lys Asp Ser 


Ser 


Glu Glu Asp Lys Lys Ala 






350 








355 


360 


Thr Leu 


Gin Val 


He 


Asn Glu Glu 


Asn 


Ser 


Phe Leu Asn Asn Ser 






365 








370 


375 


Val Met 


lie Leu 


Thr 


Tyr Ala Leu 


Met 


Asn Asp Gly Val Thr Gly 






380 








385 


390 


Leu Lys 


Glu Leu 


Ala 


Phe 


Leu Arg 


Asp 


Leu Ala Glu Gin Asn Ser 






395 








400 


405 


Gly Lys 


Tyr Gly Val 


Pro Asp Arg 


Thr 


Ala 


Leu Pro Val He Lys 






410 








415 


420 


Gly Ser 


Met Met 


Val 


Leu 


Asn Gin 


Leu 


Ser 


Asn Leu Glu Thr Thr 






425 








430 


435 


Val Gly 


Arg Phe 


Tyr 


Thr 


Asn Leu 


Pro 


Asn Arg Met He Asp Glu 






440 








445 


450 
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AX a Val irlie b€I lieu 


itjlo irne 


Ser Asp Glu 


net Vaxy Asp vjxy xitsu 










*± \j \j 




Tl o u a f> rpViv Car 
lit: L. 1 XIX Val DcX 




<-ys 


Tyr 


.trim 


vaXy Asn jjeu licU ucU 


^ / u 








<& 1 D 


Ann 


vjiy xxe val v^xy vai 


Asp vax 


Asn 


Leu 


Ala 


ryr xxe Leu v»xu Asp 


/toe; 








Aon 


AQfZ 


Val Thr Tyr Tyr Gin. 


Asp Ser 


Leu 


Ala 


Ser 


Tyr Thr Phe Leu He 


□ uu 










3X0 


Asp Asp Lys Gly Tyx 


Thr Leu 


wet 


nls 


Pro 


Ser Leu Thr Arg Pro 










con 


C *1 c 

525 


Tyx Leu Leu. Ser Glu 


Pro Pro 


Leu 


His 


Tnr 


Asp xxe xxe His Tyr 


con 










540 


Glu Asn He Pro Lys 


Phe Glu 


Leu 


Val 


Arg 


Gin Asn He Leu Ser 


545 








c c n 
DOG 


555 


Leu Pro Leu Gly Ser 


Gin He 


He 


Ala 


Val 


Pro Val Asn Ser Ser 


560 








565 


570 


Leu Ser Trp His He 


Asn Lys 


Leu Arg 


Glu 


Thr Gly Lys Glu Ala 


575 








580 


585 


Tyr Asn Val Ser Tyr 


Ala Trp 


Lys 


Met 


Val 


Gin Asp Thr Ser Phe 


590 








595 


600 


He Leu Cys He Val 


Val lie 


Gin 


Pro 


Glu 


He Pro Val Lys Gin 


605 








610 


615 


Leu Lys Asn Leu Asn 


Thr Val 


Pro 


Ser 


Ser 


Lys Leu Leu Tyr His 


620 








625 


630 


Arg Leu Asp Leu Leu 


Gly Gin 


Pro 


Ser 


Ala 


Cys Leu His Phe Lys 


635 








640 


645 


Gin Leu Ala Thr Leu 


Glu Ser 


Pro 


Thr 


He 


Met Leu Ser Ala Gly 


650 








655 


660 


Ser Phe Ser Ser Pro 


Tyr Glu 


His 


Leu 


Ser 


Gin Pro Glu Thr Lys 


665 








670 


675 


Arg Met Val Glu His 


Tyr Thr 


Ala 


Tyr 


Leu 


Ser Asp Asn Thr Arg 


680 








685 


690 


Leu He Ala Asn Pro 


Gly Leu 


Lys 


Phe 


Ser 


Val Arg Asn Glu Val 


695 








700 


705 


Met Ala Thr Ser His 


Val Thr 


Asp Glu Trp 


Met Thr Gin Met Glu 


710 








715 


720 


Met Ser Ser Leu Asn 


Thr Tyr 


He 


Val 


Arg 


Arg Tyr He Ala Thr 


725 








730 


735 


Pro Asn Gly Val Leu 


Arg He 


Tyr 


Pro Gly 


Ser Leu Met Asp Lys 


740 








745 


750 


Ala Phe Asp Pro Thr 


Arg Arg 


Gin 


Trp 


Tyr 


Leu His Ala Val Ala 


755 








760 


7ob 


Asn Pro Gly Leu He 


Ser Leu 


Thr Gly Pro 


Tyr Leu Asp Val Gly 


/ / 0 








775 


•7QA 


oxy Axa v»xy iyr vax 


vax xnr 


lie 


Ser 


His 


inr xxe nis oer £»er 


IOC 

78b 








790 


/y d 


Ser Thr -Gin Leu Ser 


Ser Gly 


His 


Thr 


Val 


AXa Vax Met cxy xxe 


O f\f\ 

800 








805 


Ql ft 

oXU 


Asp Phe Thr Leu Arg 


Tyr Phe 


Tyr Lys Val 


Leu Met Asp Leu Leu 


815 








820 


825 


Pro Val Cys Asn Gin 


Asp Gly 


Gly Asn Lys 


He Arg Cys Phe He 


830 








835 


840 


Met Glu Asp Arg Gly 


Tyr Leu 


Val 


Ala 


His 


Pro Thr Leu He Asp 


845 








850 


855 


pro Lys Gly His Ala 


Pro Val 


Glu 


Gin 


Gin 


His He Thr His Lys 


860 








865 


870 
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Glu Pro Leu Val Ala Asn Asp lie Leu Asn His Pro Asn Phe Val 
875 880 885 

Lys Lys Asn Leu Cys Asn Ser Phe Ser Asp Arg Thr Val Gin Arg 
890 895 900 

Phe Tyr Lys Phe Asn Thr Ser Leu Ala Gly Asp Leu Thr Asn Leu 
905 910 915 

Val His Gly Ser His Cys Ser Lys Tyr Arg Leu Ala Arg lie Pro 
920 925 930 

Gly Thr Asn Ala Phe Val Gly lie Val Asn Glu Thr Cys Asp Ser 
935 940 945 

Leu Ala Phe Cys Ala Cys Ser Met Val Asp Arg Leu Cys Leu Asn 
950 955 960 

Cys His Arg Met Glu Gin Asn Glu Cys Glu Cys Pro Cys Glu Cys 
965 970 975 

Pro Leu Glu Val Asn Glu Cys Thr Gly Asn Leu Thr Asn Ala Glu 
980 985 990 

Asn Arg Asn Pro Ser Cys Glu Val His Gin Glu Pro Val Thr Tyr 
995 1000 1005 

Thr Ala lie Asp Pro Gly Leu Gin Asp Ala Leu His Gin Cys Val 
1010 1015 1020 

Asn Ser Arg Cys Ser Gin Arg Leu Glu Ser Gly Asp Cys Phe Gly 
1025 1030 1035 

Val Leu Asp Cys Glu Trp Cys Met Val Asp Ser Asp Gly Lys Thr 
1040 1045 1050 

His Leu Asp Lys Pro Tyr Cys Ala Pro Gin Lys Glu Cys Phe Gly 
1055 1060 1065 

Gly He Val Gly Ala Lys Ser Pro Tyr Val Asp Asp Met Gly Ala 
1070 1075 1080 

He Gly Asp Glu Val He Thr Leu Asn Met He Lys Ser Ala Pro 
1085 1090 1095 

Val Gly Pro Val Ala Gly Gly He Met Gly Cys He Met Val Leu 
1100 1105 1110 

Val Leu Ala Val Tyr Ala Tyr Arg His Gin He His Arg Arg Ser 
1115 1120 1125 

His Gin His Met Ser Pro Leu Ala Ala Gin Glu Met Ser Val Arg 
1130 1135 1140 

Met Ser Asn Leu Glu Asn Asp Arg Asp Glu Arg Asp Asp Asp Ser 
1145 1150 -1155 

His Glu Asp Arg Gly He He Ser Asn Thr Arg Phe He Ala Ala 
1160, 1165 1170 

Val He Glu Arg His Ala His Ser Pro Glu Arg Arg Arg Arg Tyr 
1175 1180 H85 

Trp Gly Arg Ser Gly Thr Glu Ser Asp His Gly Tyr Ser Thr Met 
1190 1195 1200 

Ser Pro Gin Glu Asp Ser Glu Asn Pro Pro Cys Asn Asn Asp Pro 
1205 1210 1215 

Leu Ser Ala Gly Val Asp Val Gly Asn His Asp Glu Asp Leu Asp 
1220 1225 1230 

Leu Asp Thr Pro Pro Gin Thr Ala Ala Leu Leu Ser His Lys Phe 
1235 1240 1245 

His His Tyr Arg Ser His His Pro Thr Leu His His Ser His His 
1250 1255 1260 

Leu Gin Ala Ala Val Thr Val His Thr Val Asp Ala Glu Cys 
1265 1270 

<210> 9 
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<211> 595 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 6113039CD1 



<400> 9 

Met Lys* Phe Phe Ser Tyr lie Leu Val Tyr Arg Arg Phe Leu Phe 
15 10 15 

Val Val Phe Thr Val Leu Val Leu Leu Pro Leu Pro lie Val Leu 
20 25 30 

His Thr Lys Glu Ala Glu Cys Ala Tyr Thr Leu Phe Val Val Ala 
35 40 45 

Thr Phe Trp Leu Thr Glu Ala Leu Pro Leu Ser Val Thr Ala Leu 
50 55 60 

Leu Pro Ser Leu Met Leu Pro Met Phe Gly He Met Pro Ser Lys 
65 70 75 

Lys Val Ala Ser Ala Tyr Phe Lys Asp Phe His Leu Leu Leu lie 
80 85 90 

Gly Val He Cys Leu Ala Thr Ser He Glu Lys Trp Asn Leu His 
95 100 105 

Lys Arg He Ala Leu Lys Met Val Met Met Val Gly Val Asn Pro 

110 115 120 

Ala Trp Leu Thr Leu Gly Phe Met Ser Ser Thr Ala Phe Leu Ser 

125 130 135 

Met Trp Leu Ser Asn Thr Ser Thr Ala Ala Met Val Met Pro He 

140 145 150 

Ala Glu Ala Val Val Gin Gin He He Asn Ala Glu Ala Glu Val ' 

155 160 165 

Glu Ala Thr Gin Met Thr Tyr Phe Asn Gly Ser Thr Ash His Gly 

170 175 180 

Leu Glu He Asp Glu Ser Val Asn Gly His Glu He Asn Glu Arg 

185 190 195 

Lys Glu Lys Thr Lys Pro Val Pro Gly Tyr Asn Asn Asp Thr Gly 

200 205 210 

Lys He Ser Ser Lys Val Glu Leu Glu Lys Asn Ser Gly Met Arg 

215 220 225 

Thr Lys Tyr Arg Thr Lys Lys Gly His Val Thr Arg Lys Leu Thr 

230 235 .240 

Cys Leu Cys He Ala Tyr Ser Ser Thr He Gly Gly Leu Thr Thr 

245 250 255 

He Thr Gly Thr Ser Thr Asn Leu He Phe Ala Glu Tyr Phe Asn 

260 265 270 

Thr Arg Tyr Pro Asp Cys Arg Cys Leu Asn Phe Gly Ser Trp Phe 

275 280 285 

Thr Phe Ser Phe Pro Ala Ala Leu He He Leu Leu Leu Ser Trp 

290 295 300 

He Trp Leu Gin Trp Leu Phe Leu Gly Phe Asn Phe Lys Glu Met 

305 310 315 

Phe Lys Cys Gly Lys Thr Lys Thr Val Gin Gin Lys Ala Cys Ala 

320 325 330 

Glu Val He Lys Gin Glu Tyr Gin Lys Leu Gly Pro He Arg Tyr 

335 340 345 

Gin Glu He Val Thr Leu Val Leu Phe He He Met Ala Leu Leu 
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350 355 360 

Trp Phe Ser Arg Asp Pro Gly Phe Val Pro Gly Trp Ser Ala Leu 

365 370 375 

Phe Ser Glu Tyr Pro Gly Phe Ala Thr Asp Ser Thr Val Ala Leu 

380 385 390 

Leu He Gly Leu Leu Phe Phe Leu He Pro Ala Lys Thr Leu Thr 

395 400 405 

Lys Thr Thr Pro Thr Gly Glu He Val Ala Phe Asp Tyr Ser Pro 

410 415 420 

Leu lie Thr Trp Lys Glu Phe Gin Ser Phe Met Pro Trp Asp He 

425 430 435 

Ala He Leu Val Gly Gly Gly Phe Ala Leu Ala Asp Gly Cys Glu 

440 445 450 

Glu Ser Gly Leu Ser Lys Trp He Gly Asn Lys Leu Ser Pro Leu 

455 . 460 465 

Gly Ser Leu Pro Ala Trp Leu He He Leu He Ser Ser Leu Met 

470 475 480 

Val Thr Ser Leu Thr Glu Val Ala Ser Asn Pro Ala Thr He Thr 

485 490 495 

Leu Phe Leu Pro He Leu Ser Pro Leu Ala Glu Ala He His Val 

500 505 510 

Asn Pro Leu Tyr He Leu He Pro Ser Thr Leu Cys Thr Ser Phe 

515 520 525 

Ala Phe Leu Leu Pro Val Ala Asn Pro Pro Asn Ala He Val Phe 

530 535 540 

Ser Tyr Gly His Leu Lys Val He Asp Met Val Lys Ala Gly Leu 

545 550 555 

Gly Val Asn He Val Gly Val Ala Val Val Met Leu Gly He Cys 

560 . 565 570 

Thr Trp He Val Pro Met Phe Asp Leu Tyr Thr Tyr Pro Ser Trp 

575 580 585 

Ala Pro Ala Met Ser Asn Glu Thr Met Pro 

590 595 

<210> 10 

<211> 475 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7101781CD1 

<400> 10 

Met Ser Pro Glu Val Thr Cys Pro Arg Arg Gly His Leu Pro Arg 

15 10 15 

Phe His Pro Arg Thr Trp Val Glu Pro Val Val Ala Ser Ser Gin 

20 25 30 

Val Ala Ala Ser Leu Tyr Asp Ala Gly Leu Leu Leu Val Val Lys 

35 40 45 

Ala Set Tyr Gly Thr Gly Gly Ser Ser Asn His Ser Ala Ser Pro 

50 55 60 

Ser Pro Arg Gly Ala Leu Glu Asp Gin Gin Gin Arg Ala He Ser 

65 70 75 

Asn Phe Tyr He He Tyr Asn Leu Val Val Gly Leu Ser Pro Leu 

80 85 90 
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<210> 11 
<211> 927 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7473036CD1 

<400> 11 

Met Gin Pro Ala Arg Gly Pro Leu Ala Ser Glu Pro Arg Thr Val 
15 10 15 

Leu Val Leu Arg Phe Cys Ala Ser Leu Met Glu Met Lys Leu Pro 
20 25 ' 30 

Gly Gin Glu Gly Phe Glu Ala Ser Ser Ala Pro Arg Asn lie Pro 
35 40 45 

Ser Gly Glu Leu Asp Ser Asn Pro Asp Pro Gly Thr Gly Pro Ser 
50 55 60 

Pro Asp Gly Pro Ser Asp Thr Glu Ser Lys Glu Leu Gly Val Pro 
65 70 75 

Lys Asp Pro Leu Leu Phe lie Gin Leu Asn Glu Leu Leu Gly Trp 
80 85 90 

Pro Gin Ala Leu Glu Trp Arg Glu Thr Gly Arg Trp Val Leu Phe 
95 100 105 

Glu Glu Lys Leu Glu Val Ala Ala Gly Arg Trp Ser Ala Pro His 

110 115 120 

Val Pro Thr Leu Ala Leu Pro Ser Leu Gin Lys Leu Arg Ser Leu 

125 130 135 

Leu Ala Glu Gly Leu Val Leu Leu Asp Cys Pro Ala Gin Ser Leu 

140 145 150 

Leu Glu Leu Val Gly Ser Thr His Pro Arg Lys Ala Ser Asp Asn 

155 160 165 

Glu Glu Ala Pro Leu Arg Glu Gin Cys Gin Asn Pro Leu Arg Gin 

170 175 180 

Lys Leu Pro Pro Gly Ala Glu Ala Gly Thr Val Leu Ala Gly Glu 

185 190 195 

Leu Gly Phe Leu Ala Gin Pro Leu Gly Ala Phe Val Arg Leu Arg 

200 205 210 

Asn Pro Val Val Leu Gly Ser Leu Thr Glu Val Ser Leu Pro Ser 

215 220 225 

Arg Phe Phe Cys Leu Leu Leu Gly Pro Cys Met Leu Gly Lys Gly 

230 235 240 

Tyr His Glu Met Gly Arg Ala Ala Ala Val Leu Leu Ser Asp Pro 

245 250 255 

Gin Phe Gin Trp Ser Val Arg Arg Ala Ser Asn Leu His Asp Leu 

260 265 270 

Leu Ala Ala Leu Asp Ala Phe Leu Glu Glu Val Thr Val Leu Pro 

275 280 285 

Pro Gly Arg Trp Asp Pro Thr Ala Arg lie Pro Pro Pro Lys Cys 

290 295 300 

Leu Pro Ser Gin His Lys Arg Leu Pro Ser Gin Gin Arg Glu He 

305 310 315 

Arg Gly Pro Ala Val Pro Arg Leu Thr Ser Ala Glu Asp Arg His 

320 325 330 

Arg His Gly Pro His Ala His Ser Pro Glu Leu Gin Arg Thr Gly 

335 340 345 

Ser Asp Phe Leu Asp Ala Leu His Leu Gin Cys Phe Ser Ala Val 

350 355 360 

Leu Tyr He Tyr Leu Ala Thr Val Thr Asn Ala He Thr Phe Gly 
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365 370 375 

Gly Leu Leu Gly Asp Ala Thr Asp Gly Ala Gin Gly Val Leu Glu 

380 385 390 

Ser Phe Leu Gly Thr Ala Val Ala Gly Ala Ala Phe Cys Leu Met 

395 400 405 

Ala Gly Gin Pro Leu Thr lie Leu Ser Ser Thr Gly Pro Val Leu 

410 415 420 

Val Phe Glu Arg Leu Leu Phe Ser Phe Ser Arg Asp Tyr Ser Leu 

425 430 435 

Asp Tyr Leu Pro Phe Arg Leu Trp Val Gly lie Trp Val Ala Thr 

440 445 450 

Phe Cys Leu Val Leu Val Ala Thr Glu Ala Ser Val Leu Val Arg 

455 460 465 

Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe Cys Ala Leu lie Ser 

470 , 475 480 

Leu lie Phe He Tyr Asp Ala Val Gly Lys Met Leu Asn Leu Thr 

485 490 495 

His Thr Tyr Pro He Gin Lys Pro Gly Ser Ser Ala Tyr Gly Cys 

500 505 510 

Leu Cys Gin Tyr Pro Gly Pro Gly Gly Asn Glu Ser Gin Trp He 

515 520 525 

Arg Thr Arg Pro Lys Asp Arg Asp Asp He Val Ser Met Asp Leu 

530 535 540 

Gly Leu He Asn Ala Ser Leu Leu Pro Pro Pro Glu Cys Thr Arg 

545 550 555 

Gin Gly Gly His Pro Arg Gly Pro Gly Cys His Thr Val Pro Asp 

560 565 570 

He Ala Phe Phe Ser Leu Leu Leu Phe Leu Thr Ser Phe Phe Phe 

575 580 585 

Ala Met Ala Leu Lys Cys Val Lys Thr Ser Arg Phe Phe Pro Ser 

590 595 600 

Val Val Arg Lys Gly Leu Ser Asp Phe Ser Ser Val Leu Ala He 

605 610 615 

Leu Leu Gly Cys Gly Leu Asp Ala Phe Leu Gly Leu Ala Thr Pro 

620 625 630 

Lys Leu Met Val Pro Arg Glu Phe Lys Pro Thr Leu Pro Gly Arg 

635 640 645 

Gly Trp Leu Val Ser Pro Phe Gly Ala Asn Pro Trp Trp Trp Ser 

650 655 660 

Val Ala Ala Ala Leu Pro Ala Leu Leu Leu Ser He Leu He Phe 

665 670 675 

Met Asp Gin Gin He Thr Ala Val He Leu Asn Arg Met Glu Tyr 

680 685 690 

Arg Leu Gin Lys Gly Ala Gly Phe His Leu Asp Leu Phe Cys Val 

695 700 705 

Ala Val Leu Met Leu Leu Thr Ser Ala Leu Gly Leu Pro Trp Tyr 

710 715 720 

Val Ser Ala Thr Val He Ser Leu Ala His Met Asp Ser Leu Arg 

725 730 735 

Arg Glu Ser Arg Ala Cys Ala Pro Gly Glu Arg Pro Asn Phe Leu 

740 745 750 

Gly He Arg Glu Gin Arg Leu Thr Gly Leu Val Val Phe He Leu 

755 760 765 

Thr Gly Ala Ser He Phe Leu Ala Pro Val Leu Lys Phe He Pro 

770 775 780 

Met Pro Val Leu Tyr Gly He Phe Leu Tyr Met Gly Val Ala Ala 
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785 



790 



795 



Leu Ser Ser lie Gin Phe Thr Asn Arg Val Lys Leu Leu Leu Met 

800 805 810 

Pro Ala Lys His Gin Pro Asp Leu Leu Leu Leu Arg His Val Pro 

815 820 825 

Leu Thr Arg Val His Leu Phe Thr Ala He Gin Leu Ala Cys Leu 

830 835 840 

Gly Leu Leu Trp He He Lys Ser Thr Pro Ala Ala He He Phe 

845 850 855 

Pro Leu Met Leu Leu Gly Leu Val Gly Val Arg Lys Ala Leu Glu 

860 865 870 

Arg Val Phe Ser Pro Gin Glu Leu Leu Trp Leu Asp Glu Leu Met 

875 880 885 

Pro Glu Glu Glu Arg Ser He Pro Glu Lys Gly Leu Glu Pro Glu 

890 895 900 

His Ser Phe Ser Gly Ser Asp Ser Glu Asp Ser Glu Leu Met Tyr 

905 910 915 

Gin Pro Lys Ala Pro Glu He Asn He Ser Val Asn 



<210> 12 

<211> 516 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7476943CD1 

<400> 12 

Met Pro Ser Gly Ser His Trp Thr Ala Asn Ser Ser Lys He He 
1 5 10 15 

Thr Trp Leu Leu Glu Gin Pro Gly Lys Glu Glu Lys Arg Lys Thr 
20 25 30 

Met Ala Lys Val Asn Arg Ala Arg Ser Thr Ser Pro Pro Asp Gly 
35 40 45 

Gly Trp Gly Trp Met He Val Ala Gly Cys Phe Leu Val Thr He 
50 55 60 

Cys Thr Arg Ala Val Thr Arg Cys He Ser He Phe Phe Val Glu 
65 70 75 

Phe Gin Thr Tyr Phe Thr Gin Asp Tyr Ala Gin Thr Ala Trp He 
80 85 90 

His Ser He Val Asp Cys Val Thr Met Leu Cys Ala Pro Leu Gly 
95 100 105 

Ser Val Val Ser Asn His Leu Ser Cys Gin Val Gly He Met Leu 
110 115 120 

Gly Gly Leu Leu Ala Ser Thr Gly Leu He Leu Ser Ser Phe Ala 
125 130 135 

Thr Ser Leu Lys His Leu Tyr Leu Thr Leu Gly Val Leu Thr Gly 
140 145 150 

Leu Gly Phe Al-a Leu Cys Tyr Ser Pro Ala He Ala Met Val Gly 
155 160 165 

Lys Tyr Phe Ser Arg Arg Lys Ala Leu Ala Tyr Gly He Ala Met 
170 175 180 

Ser Gly Ser Gly He Gly Thr Phe He Leu Ala Pro Val Val Gin 



920 



925 



185 



190 



195 
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Leu Leu lie Glu Gin Phe Ser Trp Arg Gly Ala Leu Leu lie Leu 

200 205 210 

Gly Gly Phe Val Leu Asn Leu Cys Val Cys Gly Ala Leu Met Arg 

215 220 225 

Pro lie Thr Leu Lys Glu Asp His Thr Thr Pro Glu Gin Asn His 

230 235 240 

Val Cys Arg Thr Gin Lys Glu Asp He Lys Arg Val Ser Pro Tyr 

245 250 255 

Ser Ser Leu Thr Lys Glu Trp Ala Gin Thr Cys Leu Cys Cys Cys 

260 265 270 

Leu Gin Gin Glu Tyr Ser Phe Leu Leu Met Ser Asp Phe Val Val 

275 280 285 

Leu Ala Val Ser Val Leu Phe Met Ala Tyr Gly Cys Ser Pro Leu 

290 295 300 

Phe Val Tyr Leu Val Pro Tyr Ala Leu Ser Val Gly Val Ser His 

305 310 315 

Gin Gin Ala Ala Phe Leu Met Ser He Leu Gly Val He Asp He 

320 325 330 

He Gly Asn He Thr Phe Gly Trp Leu Thr Asp Arg Arg Cys Leu 

335 340 345 

Lys Asn Tyr Gin Tyx Val Cys Tyr Leu Phe Ala Val Gly Met Asp 

350 355 360 

Gly Leu Cys Tyr Leu Cys Leu Pro Met Leu Gin Ser Leu Pro Leu 

365 370 375 

Leu Val Pro Phe Ser Cys Thr Phe Gly Tyr Phe Asp Gly Ala Tyr 

380 385 390 

Val Thr Leu He Pro Val Val Thr Thr Glu He Val Gly Thr Thr 

395 400 405 

Ser Leu Ser Ser Ala Leu Gly Val Val Tyr Phe Leu His Ala Val 

410 415 420 

Pro Tyr Leu Val Ser Pro Pro He Ala Gly Arg Leu Val Asp Thr 

425 430 435 

Thr Gly Ser Tyr Thr Ala Ala Phe Leu Leu Cys Gly Phe Ser Met 

440 445 450 

He Phe Ser Ser Val Leu Leu Gly Phe Ala Arg Leu He Lys Arg 

455 460 465 

Met Arg "Lys Thr Gin Leu Gin Phe He Ala Lys Glu Ser Asp Pro 

470 475 480 

Lys Leu Gin Leu Trp Thr Asn Gly Ser Val Ala Tyr Ser Val Ala 

485 490 495 

Arg Glu Leu Asp Gin Lys His Gly Glu Pro Val Ala Thr Ala Val 

500 505 510 

Pro Gly Tyr Ser Leu Thr 

515 



<210> 13 

<211> 514 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Xncyte ID No: 8003355CD1 

<400> 13 

Met His Gly Gly Gin Gly Pro Leu Leu Leu Leu Leu Leu Leu Ala 
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15 10 15 

Val Cys Leu Gly Ala Gin Gly Arg Asn Gin Glu Glu Arg Leu Leu 
20 25 30 

Ala Asp Leu Met Gin Asn Tyr Asp Pro Asn Leu Arg Pro Ala Glu 
35 40 45 

Arg Asp Ser Asp Val Val Asn Val Ser Leu Lys Leu Thr Leu Thr 
50 55 60 

Asn Leu lie Ser Leu Asn Glu Arg Glu Glu Ala Leu Thr Thr Asn 
65 70 75 

Val Trp lie Glu Val Gin Trp Cys Asp Tyr Arg Leu Arg Arg Asp 
80 85 90 

Pro Arg Asp Tyr Glu Gly Leu Trp Val Leu Arg Val Pro Ser Thr 
95 100 105 

Met Val Trp Arg Pro Asp lie Val Leu Glu Asn Asn Ala Asp Gly 

110 115 120 

Val Phe Glu Val Ala Leu Tyr Cys Asn Val Leu Val Ser Pro Asp 

125 130 135 

Gly Cys lie Tyr Trp Leu Pro Pro Ala He Phe Arg Ser Ala Cys 

140 145 150 

Ser He Ser Val Thr Tyr Phe Pro Phe Asp Trp Gin Asn Cys Ser 

155 160 165 

Leu He Phe Gin Ser Gin Thr Tyr Ser Thr Asn Glu He Asp Leu 

170 175 180 

Gin Leu Ser Gin Glu Asp Gly Gin Thr He Glu Trp He Phe He 

185 190 195 

Asp Pro Glu Ala Phe Thr Glu Asn Gly Glu Trp Ala He Gin His 

200 205 210 

Arg Pro Ala Lys Met Leu Leu Asp Pro Ala Ala Pro Ala Gin Glu 

215 220 225 

Ala Gly His Gin Lys Val Val Phe Tyr Leu Leu He Gin Arg Lys 

230 235 240 

Pro Leu Phe Tyr Val He Asn He He Ala Pro Cys Val Leu He 

245 250 255 

Ser Ser Val Ala He Leu He His Phe Leu Pro Ala Lys Ala Gly 

260 265 270 

Gly Gin Lys Cys Thr Val Ala He Asn Val Leu Leu Ala Gin Thr 

275 280 285 

Val Phe Leu Phe Leu Val Ala Lys Lys Val Pro Glu Thr Ser Gin 

290 295 300 

Ala Val Pro Leu He Ser Lys Tyr Leu Thr Phe Leu Leu Val Val 

305 . 310 315 

Thr He Leu He Val Val Asn Ala Val Val Val Leu Asn Val Ser 

320 325 330 

Leu Arg Ser Pro His Thr His Ser Met Ala Arg Gly Val Phe Leu 

335 340 - 345 

Arg Leu Leu Pro Gin Leu Leu Arg Met His Val Arg Pro Leu Ala 

350 355 360 

Pro Ala Ala Val Gin Asp Thr Gin Ser Arg Leu Gin Asn Gly Ser 

365 370 375 

Ser Gly Trp Ser He Thr Thr Gly Glu Glu Val Ala Leu Cys Leu 

380 t 385 390 

Pro Arg Ser Glu Leu Leu Phe Gin Gin Trp Gin Arg. Gin Gly Leu 

395 400 405 

Val Ala Ala Ala Leu Glu Lys Leu Glu Lys Gly Pro Glu Leu Gly 

410 415 420 

Leu Ser Gin Phe Cys Gly Ser Leu Lys Gin Ala Ala Pro Ala He 
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425 



430 



435 



Gin Ala Cys Val Glu Ala Cys Asn Leu lie Ala Cys Ala Arg His 

440 445 450 

Gin Gin Ser His Phe Asp Asn Gly Asn Glu Glu Trp Phe Leu Val 

455 460 465 

Gly Arg Val Leu Asp Arg Val Cys Phe Leu Ala Met Leu Ser Leu 

470 475 480 

Phe lie Cys Gly Thr Ala Gly lie Phe Leu Met Ala His Tyr Asn 

485 490 495 

Arg Val Pro Ala Leu Pro Phe Pro Gly Asp Pro Arg Pro Tyr Leu 

500 505 510 

Pro Ser Pro Asp 



<210> 14 

<211> 691 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3116448CD1 



<400> 14 

Met Glu Leu Arg Ser Thr Ala Ala Pro Arg Ala Glu Gly Tyr Ser 
1 5 -10 15 

Asn Val Gly Phe Gin Asn Glu Glu Asn Phe Leu Glu Asn Glu Asn 
20 25 30 

Thr Ser Gly Asn Asn Ser lie Arg Ser Arg Ala Val Gin Ser Arg 
35 40 45 

Glu His Thr Asn Thr Lys Gin Asp Glu Glu Gin Val Thr Val Glu 
50 55 60 

Gin Asp Ser Pro Arg Asn Arg Glu His Met Glu Asp Asp Asp Glu 
65 70 75 

Glu Met Gin Gin Lys Gly Cys Leu Glu Arg Arg Tyr Asp Thr Val 
80 85 90 

Cys Gly Phe Cys Arg Lys His Lys Thr Thr Leu Arg His He He 
95 100 105 

Trp Gly He lieu Leu Ala Gly Tyr Leu Val Met Val lie Ser Ala 
110 115 120 

Cys Val Leu Asn Phe His Arg Ala Leu Pro Leu Phe Val He Thr 
125 130 135 

Val Ala Ala He Phe Phe Val Val Trp Asp His Leu Met Ala Lys 
140 145 150 

Tyr Glu His Arg He Asp Glu Met Leu Ser Pro Gly Arg Arg Leu 
155 160 165 

Leu Asn Ser His Trp Phe Trp Leu Lys Trp Val He Trp Ser Ser 
170 175 180 

Leu Val Leu Ala Val lie Phe Trp Leu Ala Phe Asp Thr Ala Lys 
185 190 195 

Leu Gly Gin Gin Gin Leu Val Ser Phe Gly Gly Leu He Met Tyr 
200 205 210 

He Val Leu Leu Phe Leu Phe Ser Lys Tyr Pro Thr Arg Val Tyr 
215 220 ' 225 

Trp Arg Pro Val Leu Trp Gly He Gly Leu Gin Phe Leu Leu Gly 



- 230 



235 



240 
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Leu Leu lie Leu Arg Thr Asp Pro Gly Phe lie Ala Phe Asp Trp 

245 250 255 

Leu Gly Arg Gin Val Gin Thr Phe Leu Glu Tyr Thr Asp Ala Gly 

260 265 270 

Ala Ser Phe Gly Phe Gly Glu Lys Tyr Lys Asp His Phe Phe Gly 

275 280 285 

Phe Lys Val Leu Ala lie Val Val Phe Phe Ser Thr Val Met Ser 

290 295 300 

Met Leu Tyr Tyr Leu Gly Leu Met Gin Trp lie lie Arg Lys Val 

305 310 315 

Gly Trp lie Met Leu Val Thr Thr Gly Ser Ser Pro lie Glu Ser 

320 325 330 

Val Val Ala Ser Gly Asn lie Phe Val Gly Gin Thr Glu Ser Pro 

335 340 345 

Leu Leu Val Arg Pro Tyr Leu Pro Tyr lie Thr Lys Ser Glu Leu 

350 355 360 

His Ala He Met Thr Ala Gly Phe Ser Thr He Ala Gly Ser Val 

365 370 375 

Leu Gly Ala Tyr He Ser Phe Gly Val Pro Ser Ser His Leu Leu 

380 ^ 385 390 

Thr Ala Ser Val Met Ser Ala Pro Ala Ser Leu Ala Ala Ala Lys 

395 400 405 

Leu Phe Trp Pro Glu Thr Glu Lys Pro Lys He Thr Leu Lys Asn 

410 415 420 

Ala Met Lys Met Glu Ser Gly Asp Ser Gly Asn Leu Leu Glu Ala 

425 430 435 

Ala Thr Gin Gly Ala Ser Ser Ser He Ser Leu Val Ala Asn He 

440 445 450 

Ala Val Asn Leu He. Ala Phe Leu Ala Leu Leu Ser Phe Met Asn 

455 460 465 

Ser Ala Leu Ser Trp Phe Gly Asn Met Phe Asp Tyr- Pro Gin Leu 

470 475 480 

Ser Phe Glu Leu He Cys Ser Tyr He Phe Met Pro Phe Ser Phe 

485 490 495 

Met Met Gly Val Glu Trp Gin Asp Ser Phe Met Val Ala Arg Leu 

500 505 510 

He Gly Tyr Lys - Thr Phe Phe Asn Glu Phe Val Ala Tyr Glu His 

515 520 525 

Leu Ser Lys Trp He His Leu Arg Lys Glu Gly Gly Pro Lys Phe 

530 535 540 

Val Asn Gly Val Gin Gin Tyr He Ser He Arg Ser Glu He He 

545 550 555 

Ala Thr Tyr Ala Leu Cys Gly Phe Ala Asn He Gly Ser Leu Gly 

560 565 570 

He Val He Gly Gly Leu Thr Ser Met Ala Pro Ser Arg Lys Arg 

575 580 585 

Asp He Ala Ser Gly Ala Val Arg Ala Leu He Ala Gly Thr Val 

590 595 600 

Ala Cys Phe Met Thr Ala Cys He Ala Gly He Leu Ser Ser Thr 

605 610 615 

Pro Val Asp He Asn Cys His His Val Leu Glu Asn Ala Phe Asn 

620 625 630 

Ser Thr Phe Pro Gly Asn Thr Thr Lys Val He Ala Cys Cys Gin 

635 640 645 

Ser Leu Leu Ser Ser Thr Val Ala Lys Gly Pro Gly Glu Val He 

650 655 660 



28/71 



WO 02/22684 



PCT/US01/28938 



Pro Gly Gly Asn His Ser Leu Tyr Ser Leu Lys Gly Cys Cys Thr 
665 670 675 

Leu Leu Asn Pro Ser Thr Phe Asn Cys Asn Gly He Ser Asn Thr 
680 685 690 

Phe 



<210> 15 

<211> 342 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 622868CD1 

<400> 15 

Met Lys Ser Arg Thr Trp Ala Ser Val His Leu His Ser Phe Phe 
15 10 15 

Ala Val Gly Thr Leu Leu Val Ala Leu Thr Gly Tyr Leu Val Arg 
20 25 30 

Thr Trp Trp Leu Tyr Gin Met He Leu Ser Thr Val Thr Val Pro 
35 40 45 

Phe He Leu Cys Cys Trp Val Leu Pro Glu Thr Pro Phe Trp Leu 
50 55 60 

Leu Ser Glu Gly Arg Tyr Glu Glu Ala Gin Lys He Val Asp He 
65 70 75 

Met Ala Lys Trp Asn Arg Ala Ser Ser Cys Lys Leu Ser Glu Leu 
80 85 90 

Leu Ser Leu Asp Leu Gin Gly Pro Val Ser Asn Ser Pro Thr Glu 
95 100 105 

Val Gin Lys His Asn Leu Ser Tyr Leu Phe Tyr Asn Trp Ser He 

110 115 120 

Thr Lys Arg Thr Leu Thr Val Trp Leu lie Trp Phe Thr Gly Ser 

125 130 135 

Leu Gly Phe Tyr Ser Phe Ser Leu Asn Ser Val Asn Leu Gly Gly 

140 145 150 

Asn Glu Tyr Leu Asn Leu Phe Leu Leu Gly Val Val Glu He Pro 

155 160 165 

Ala Tyr Thr Phe Val Cys He Ala Thr Asp Lys Val Gly Arg Arg 

170 175 180 

Thr Val Leu Ala Tyr Ser Leu Phe Cys Ser Ala Leu Ala Cys Gly 

185 190 195 

Val Val Met Val He Pro Gin Lys His Tyr He Leu Gly Val Val 

200 205 210 

Thr Ala Met Val Gly Lys Phe Ala He Gly Ala Ala Phe Gly Leu 

215 220 225 

He Tyr Leu Tyr Thr Ala Glu Leu Tyr Pro Thr He Val Arg Ser 

230 235 240 

Leu Ala Val Gly Ser Gly Ser Met Val Cys Arg Leu Ala Ser He 

245 250 255 

Leu Ala Pro Phe Ser Val Asp Leu Ser Ser He Trp He Phe He 

260 265 270 

Pro Gin Leu Phe Val Gly Thr Met Ala Leu Leu Ser Gly Val Leu 

275 280 285 

Thr Leu Lys Leu Pro Glu Thr Leu Gly Lys Arg Leu Ala Thr Thr 
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290 295 300 

Trp Glu Glu Ala Ala Lys Leu Glu Ser Glu Asn Glu Ser Lys Ser 

305 310 315 

Ser Lys Leu Leu Leu Thr Thr Asn Asn Ser Gly Leu Glu Lys Thr 

320 325 330 

Glu Ala lie Thr Pro Arg Asp Ser Gly Leu Gly Glu 

335 340 



<210> 16 
<211> 791 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7476494CD1 

<400> 16 



Met 


Gly 


His 


Phe 


Glu 


Lys 


Gly 


Gin 


His Ala 


Leu Leu Asn Glu Gly 


1 








5 








10 




15 


Glu 


Glu 


Asn 


Glu 


Met 


Glu 


He 


Phe Gly Tyr 


Arg Thr Gin Gly Cys 










20 








25 




30 


Arg 


Lys 


Ser 


Leu 


Cys 


Leu 


Ala 


Gly Ser He 


Phe Ser Phe 


Gly He 










35 








40 




45 


Leu 


Pro 


Leu 


Val 


Phe 


Tyr 


Trp 


Arg 


Pro Ala 


Trp His Val Trp Ala 










50 








55 




60 


His 


Cys 


Val 


Pro 


Cys 


Ser 


Leu 


Gin 


Glu Ala 


Asp Thr Val 


Leu Leu 










65 








70 




75 


Arg 


Thr 


Thr 


Val 


Arg 


Cys 


He 


Lys 


Val Gin 


Lys He Arg 


Tyr Val 










80 








85 




90 


Trp 


Asn 


Tyr 


Leu 


Glu 


Gly 


Gin 


Phe Gin Lys 


He Gly Ser 


Leu Glu 










95 








100 




105 


Asp 


Trp 


Leu 


Ser 


Ser 


Ala 


Lys 


He 


His Gin 


Lys Phe Gly Ser Gly 










110 








115 




120 


Leu 


Thr 


Arg 


Glu 


Glu 


Gin 


Glu 


He 


Arg Arg 


Leu Met Cys 


Gly Pro 










125 








130 




135 


Asn 


Thr 


He 


Asp 


Val 


Glu 


Val 


Thr 


Pro He 


Trp Lys Leu 


Leu He 










140 








145 




150 


Lys 


Glu 


Val 


Leu 


Asn 


Pro 


Phe 


Tyr 


He Phe 


Gin Leu Phe 


Ser Val 










155 








160 




165 


Cys 


Leu 


Trp 


Phe 


Ser 


Glu 


Asp 


Tyr 


Lys Glu 


Tyr Ala Phe 


Ala He 










170 








175 




180 


He 


He 


Met 


Ser 


He 


He 


Ser 


He 


Ser Leu 


Thr Val Tyr 


Asp Leu 










185 








190 




195 


Arg 


Glu 


Gin 


Ser 


Val 


Lys 


Leu 


His 


His Leu 


Val Glu Ser 


His Asn 










200 








205 




210 


Ser 


He 


Thr 


Val 


Ser 


Val 


Cys 


Gly Arg Lys 


Ala Gly Val 


Gin Glu 










215 








220 




225 


Leu 


Glu 


Ser 


Arg 


Val 


Leu 


Val 


Pro Gly Asp 


Leu Leu He 


Leu Thr 










230 








235 




240 


Gly 


Asn 


Lys 


Val 


Leu 


Met 


Pro 


Cys 


Asp Ala 


Val Leu He Glu Gly 










245 








250 




255 


Ser 


Cys 


Val 


Val 


Asp 


Glu 


Gly 


Met 


Leu Thr 


Gly Glu Ser 


He Pro 










260 








265 




270 


Val 


Thr 


Lys 


Thr 


Pro 


Leu 


Pro 


Lys 


Met Asp 


Ser Ser Val 


Pro Trp 










275 








280 




285 
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Lys Thr Gin 
Gly Thr Glu 
Arg Ala Val 
Leu Val Arg 
Tyr Arg Asp 
Thr lie Gly 
Glu Pro Pro 
lie Ala Val 
lie Tyr Ala 
Ser Pro Gin 
Phe Asp Lys 
Gly Val Val 
Phe Ala Ser 
Met Ala Ser 
Gly Asp Pro 
Met Ala Phe 
His Ala Met 
Val Glu Gly 
Leu Gin Arg 
Leu Ala Phe 
Gin Pro Glu 
Tyr Thr Thr 
Leu Glu Asn 
Glu Ser Asp 
Leu Lys Glu 
Arg lie Arg 
lie Thr Val 
Val lie Leu 



Ser Glu Ala 

290 
Val lie Gin 

305 
Val Leu Gin 

320 
Ser lie Leu 

335 
Ala lie Arg 

350 
Met lie Tyr 

365 
Glu Glu Val 

380 
Pro Pro Ala 

395 
Gin Arg Arg 

410 
Arg lie Asn 

425 
Thr Gly Thr 

440 
Ser Cys Asp 

455 
Gly Gin Ala 

470 
Cys His Ser 

485 
Leu Asp Leu 

500 
Ser Gly Asp 

515 
Val Val Lys 

530 
He Ala He 

545 
Met Thr Val 

560 

Met Lys Gly 

575 
Thr Val Pro 

590 
Gin Gly Phe 

605 
Asp His His 

620 
Leu He Phe 

635 
Glu Thr Lys 

650 

Thr Val Met 

665 
Ala Arg Lys 

680 
He Glu Ala 

695 



Asp Tyr Lys 
Ala Lys Ala 
Thr Gly Phe 
Tyr Pro Lys 
Phe Leu Leu 
Thr Leu Cys 
Val Arg Lys 
Leu Pro Ala 
Leu Lys Lys 
Val Cys Gly 
Leu Thr Arg 
Arg Asn Gly 
Leu Pro Trp 
Leu He Leu 
Lys Met Phe 
Asp Phe His 
Pro Cys Arg 
Leu His Gin 
He Val Gin 
Ala Pro Glu 
Thr Ser Phe 
Arg Val He 
Ala Thr Thr 
Leu Gly Leu 
Pro Val Leu 
lie Thr Gly 
Ser Gly Met 
Asn Glu Thr 



Arg His Val 
295 

Ala Cys Ser 
310 

Asn Thr Ala 
325 

Pro Val Asn 
340 

Cys Leu Val 
355 

Val Tyr Val 
370 

Ala Leu Asp 
385 

Ala Leu Thr 
400 

Arg Gly He 
415 

Gin Leu Asn 
430 

Asp Gly Leu 
445 

Phe Gin Glu 
460 

Gly Pro Leu 
475 

Leu Asp Gly 
490 

Glu Ala Thr 
505 

He Lys Gly 
520 

Thr Ala Ser 
535 

Phe Pro Phe 
550 

Glu Met Gly 
565 

Arg Val Ala 
580 

Val Ser Glu 
595 

Ala Leu Ala 
610 

Leu Thr Arg 
625 

Leu He Leu 
640 

Glu Glu Leu 
655 

Asp Asn Leu 
670 

Val Ser Glu 
685 

Thr Gly Ser 
700 



Leu Phe Cys 
300 

Gly Thr Val 
315 

Lys Gly Asp 
330 

Phe Gin Leu 
345 

Gly Thr Ala 
360 

Leu Ser Gly 
375 

Val He Thr 
.390 

Thr Gly He 
405 

Phe Cys lie 
420 

Leu Val Cys 
435 

Asp Leu Trp 
450 

Val His Ser 
465 

Cys Ala Ala 
480 

Thr He Gin 
495 

Thr Trp Glu 
510 

Val Pro Ala 
525 

Gin Val Pro 
540 

Ser Ser Ala 
555 

Gly Asp Arg 
570 

Ser Phe Cys 
585 

Leu Gin He 
600 

Tyr Lys Lys 
615 

Glu Thr Val 
630 

Glu Asn Arg 
645 

He Ser Ala 
660 

Gin Thr Ala 
675 

Ser Gin Lys 
690 

Ser Ser Ala 
705 
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Ser lie Ser Trp Thr Leu Val Glu Glu Lys Lys His He Met Tyr 

710 715 720 

Gly Asn Gin Asp Asn Tyr He Asn He Arg Asp Glu Val Ser Asp 

725 730 735 

Lys Gly Arg Glu Gly Ser Tyr His Phe Ala Leu Thr Gly Lys Ser 

740 745 750 

Phe His Val He Ser Gin His Phe Ser Ser Leu Leu Pro Lys He 

755 760 765 

Leu He Asn Gly Thr He Phe Ala Arg Met Ser Pro Gly Gin Lys 

770 775 780 

Ser Ser Leu Val Glu Glu Phe Gin Lys Leu Glu 

785 790 



<210> 17 
<211> 1108 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7477260CD1 



<400> 17 

Met Val Thr Gly Gly Gin His His Pro Gly Ala Gly Leu Ser Phe 
1 5 10 15 

Thr Glu Leu Glu Asn Thr Phe Pro Leu Cys Leu Pro Pro Thr Pro 
20 25 30 

Phe Leu Leu Ala Leu Trp Ser Ser Cys Leu Pro Trp Asp Thr Gin 
35 40 45 

Gin Thr Cys Cys Pro Ser Phe Ala Gly Ser Pro Ala Ala Glu Gin 
50 55 60 

Leu Gin Asp He Leu Gly Glu Glu Asp Glu Ala Pro Asn Pro Thr 
65 70 75 

Leu Phe Thr Glu Met Asp Thr Leu Gin His Asp Gly Asp Gin Met 
80 85 90 

Glu Trp Lys Glu Ser Ala Arg Trp He Lys Phe Glu Glu Lys Val 
95 100 105 

Glu Glu Gly Gly Glu Arg Trp Ser Lys Pro His Val Ser Thr Leu 
110 115 120 

Ser Leu His Ser Leu Phe Glu Leu Arg Thr Cys Leu Gin Thr Gly 
125 130 135 

Thr Val Leu Leu Asp Leu Asp Ser Gly Ser Leu Pro Gin He He 
140 145 150 

Asp Asp Val He Glu Lys Gin He Glu Asp Gly Leu Leu Arg Pro 
155 160 165 

Glu Leu Arg Glu Arg Val Ser Tyr Val Leu Leu Arg Arg His Arg 
170 175 180 

His Gin Thr Lys Lys Pro He His Arg Ser Leu Ala Asp He Gly 
185 190 195 

Lys Ser Val Ser Thr Thr Asn Arg Ser Pro Ala Arg Ser Pro Gly 
200 205 210 

Ala Gly Pro Ser Leu His His Ser Thr Glu Asp Leu Arg Met Arg 
215 220 225 

Gin Ser Ala Asn Tyr Gly Arg Leu Cys His Ala Gin Ser Arg Ser 
230 235 240 

Met Asn Asp He Ser Leu Thr Pro Asn Thr Asp Gin Arg Lys Asn 
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245 250 255 

Lys Phe Met Lys Lys lie Pro Lys Asp Ser Glu Ala Ser Asn Val 

260 265 270 

Leu Val Gly Glu Val Asp Phe Leu Asp Gin Pro Phe lie Ala Phe 

275 280 285 

Val Arg Leu lie Gin Ser Ala Met Leu Gly Gly Val Thr Glu Val 

290 295 300 

Pro Val Pro Thr Arg Phe Leu Phe lie Leu Leu Gly Pro Ser Gly 

305 310 315 

Arg Ala Lys Ser Tyr Asn Glu lie Gly Arg Ala He Ala Thr Leu 

320 325 330 

Met Val Asp Asp Leu Phe Ser Asp Val Ala Tyr Lys Ala Arg Asn 

335 340 345 

Arg Glu Asp Leu He Ala Gly He Asp Glu Phe Leu Asp Glu Val 

350 355 360 

He Val Leu Pro Pro Gly Glu Trp Asp Pro Asn He Arg He Glu 

365 370 375 

Pro Pro Lys Lys Val Pro Ser Ala Asp Lys Arg Lys Ser Leu Phe 

380 385 390 

Ser Leu Ala Glu Leu Gly Gin Met Asn Gly Ser Val Gly Gly Gly 

395 400 405 

Gly Gly Ala Pro Gly Gly Gly Asn Gly Gly Gly Gly Gly Gly Gly 

410 415 420 

Ser Gly Gly Gly Ala Gly Ser Gly Gly Ala Gly Gly Thr Ser Ser 

425 430 435 

Gly Asp Asp Gly Glu Met Pro Ala Met His Glu He Gly Glu Glu 

440 445 450 

Leu He Trp Thr Gly Arg Phe Phe Gly Gly Leu Cys Leu Asp He 

455 460 465 

Lys Arg Lys Leu Pro Trp Phe Pro Ser Asp Phe Tyr Asp Gly Phe 

470 475 480 

His He Gin Ser He Ser Ala He Leu Phe He Tyr Leu Gly Cys 

485 490 495 

He Thr Asn Ala He Thr Phe Gly Gly Leu Leu Gly Asp Ala Thr 

500 505 510 

Asp Asn Tyr Gin Gly Val Met Glu Ser Phe Leu Gly Thr Ala Met 
- 515 520 525 

Ala Gly Ser Leu Phe Cys Leu Phe Ser Gly Gin Pro Leu He He 

530 535 540 

Leu Ser Ser Thr Gly Pro He Leu He Phe Glu Lys Leu Leu Phe 

545 550 555 

Asp Phe Ser Lys Gly Asn Gly Leu Asp Tyr Met Glu Phe Arg Leu 

560 565 570 

Trp He Gly Leu His Ser Ala Val Gin Cys Leu lie Leu Val Ala 

575 580 585 

Thr Asp Ala Ser Phe He He Lys Tyr He Thr Arg Phe Thr Glu 

590 595 600 

Glu Gly Phe Ser Thr Leu He Ser Phe He Phe He Tyr Asp Ala 

605 610 615 

He Lys Lys Met He Gly Ala Phe Lys Tyr Tyr p *"o He Asn Met 

620 625 630 

Asp Phe Lys Pro Asn Phe He Thr Thr Tyr Lys Cys Glu Cys Val 

635 640 645 

Ala Pro Asp Thr Gly Asp Leu Asn Thr Thr Val Phe Asn Ala Ser 

650 655 660 

Ala Pro Leu Ala Pro Asp Thr Asn Ala Ser Leu Tyr Asn Leu Leu 
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665 670 675 

Asn- Leu Thr Ala Leu Asp Trp Ser Leu Leu Ser Lys Lys Glu Cys 

680 685 690 

Leu Ser Tyr Gly Gly Arg Leu Leu Gly Asn Ser Cys Lys Phe He 

695 700 705 

Pro Asp Leu Ala Leu Met Ser Phe He Leu Phe Phe Gly Thr Tyr 

710 715 720 

Ser Met Thr Leu Thr Leu Lys Lys Phe Lys Phe Ser Arg Tyr Phe 

725 730 735 

Pro Thr Lys Val Arg Ala Leu Val Ala Asp Phe Ser He Val Phe 

740 745 . 750 

Ser He Leu Met Phe Cys Gly He Asp Ala Cys Phe Gly Leu Glu 

755 760 765 

Thr Pro Lys Leu His Val Pro Ser Val He Lys Pro Thr Arg Pro 

770 775 780 

Asp Arg Gly Trp Phe Val Ala Pro Phe Gly Lys Asn Pro Trp Trp 

785 790 795 

Val Tyr Pro Ala Ser He Leu Pro Ala Leu Leu Val Thr He Leu 

800 805 810 

He Phe Met Asp Gin Gin He Thr Ala Val He Val Asn Arg Lys 

815 820 825 

Glu Asn Lys Leu Lys Lys Ala Ala Gly Tyr His Leu Asp Leu Phe 

830 835 840 

Trp Val Gly He Leu Met Ala Leu Cys Ser Phe Met Gly Leu Pro 

845 850 855 

Trp Tyr Val Ala Ala Thr Val He Ser He Ala His He Asp Ser 

860 865 870 

Leu Lys Met Glu Thr Glu Thr Ser Ala Pro Gly Glu Gin Pro Gin 

875 880 885 

Phe Leu Gly Val Arg Glu Gin Arg Val Thr Gly He He Val Phe 

890 895 900 

He Leu Thr Gly He Ser Val Phe Leu Ala Pro He Leu Lys Cys 

905 910 915 

He Pro Leu Pro Val Leu Tyr Gly Val Phe Leu Tyr Met Gly Val 

920 925 930 

Ala Ser Leu Asn Gly He Gin Phe Trp Glu Arg Cys Lys Leu Phe 

935 940 945 

Leu Met Pro Ala Lys His Gin Pro Asp His Ala Phe Leu Arg His 

950 955 960 

Val Pro Leu Arg Arg He His Leu Phe Thr Leu Val Gin He Leu 

965 , 970 975 

Cys Leu Ala Val Leu Trp He Leu Lys Ser Thr Val Ala Ala He 

980 985 990 

He Phe Pro Val Met He Leu Gly Leu He He Val Arg Arg Leu 

995 1000 1005 

Leu Asp Phe He Phe Ser Gin His Asp Leu Ala Trp He Asp Asn 
1010 1015 1020 

He Leu Pro Glu Lys Glu Lys Lys Glu Thr Asp Lys Lys Arg Lys 
1025 1030 1035 

Arg Lys Lys Gly Ala His Glu Asp Cys Asp Glu Glu Glu Lys Asp 
1040 1045 1050 

Leu Pro Val Gly Val Thr His Ser Asp Ser Ser Phe Ser Asp Thr 
1055 1060 1065 

Glu Leu Asp Arg Ser Tyr Ser Arg Asn Pro Val Phe Met Val Pro 
1070 1075 1080 

Gin Val Lys He Glu Met Glu Ser Asp Tyr Asp Phe Thr Asp Met 
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1085 

Asp Lys Tyr Arg Arg 
1100 



1090 

Glu Thr Asp Ser Glu 
1105 



Thr Thr Leu 



1095 



<210> 18 

<211> 480 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1963058CD1 

<400> 18 

Met Gly Pro Gly Pro Pro Ala Ala Gly Ala Ala Pro Ser Pro Arg 
15 10 15 

Pro Leu Ser Leu Val Ala Arg Leu Ser Tyr t Ala Val Gly His Phe 
20 25* 30 

Leu Asn Asp Leu Cys Ala Ser Met Trp Phe Thr Tyr Leu Leu Leu 
35 40 45 

Tyr Leu His Ser Val Arg Ala Tyr Ser Ser Arg Gly Ala Gly Leu 
50 55 60 

Leu Leu Leu Leu Gly Gin Val Ala Asp Gly Leu Cys Thr Pro Leu 
65 70 75 

Val Gly Tyr Glu Ala Asp Arg Ala Ala Ser Cys Cys Ala Arg Tyr 
80 85 90 

Gly Pro Arg Lys Ala Trp His Leu Val Gly Thr Val Cys Val Leu 
95 100 105 

Leu Ser Phe Pro Phe He Phe Ser Pro Cys Leu Gly Cys Gly Ala 

110 115 120 

Ala Thr Pro Glu Trp Ala Ala Leu Leu Tyr Tyr Gly Pro Phe He 

125 130 135 

Val He Phe Gin Phe Gly Trp Ala Ser Thr Gin lie Ser His Leu 

140 145 150 

Ser Leu He Pro Glu Leu Val Thr Asn Asp His Glu Lys Val Glu 

155 160 165 

Leu Thr Ala Leu Arg Tyr Ala Phe Thr Val Val Ala Asn He Thr 

170 175 180 

Val Tyr Gly Ala Ala Trp Leu Leu Leu His Leu Gin Gly Ser Ser 

185 190 195 

Arg Val Glu Pro Thr Gin Asp He Ser He Ser Asp Gin Leu Gly 

200 205 210 

Gly Gin Asp Val Pro Val Phe Arg Asn Leu Ser Leu Leu Val Val 

215 220 225 

Gly Val Gly Ala Val Phe Ser Leu Leu Phe His Leu Gly Thr Arg 

230 235 240 

Glu Arg Arg Arg Pro His Ala Glu Glu Pro Gly Glu His Thr Pro 

245 250 255 

Leu Leu Ala Pro Ala Thr Ala Gin Pro Leu Leu Leu Trp Lys His 

260 265 270 

Trp Leu Arg Glu Pro Ala Phe Tyr Gin Val Gly He Leu Tyr Met 

275 280 285 

Thr Thr Arg Leu He Val Asn Leu Ser Gin Thr Tyr Met Ala Met 

290 295 300 

Tyr Leu Thr Tyr Ser Leu His Leu Pro Lys Lys Phe He Ala Thr 



305 



310 



.315 
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lie Pro Leu Val Met Tyr Leu Ser Gly Phe Leu Ser Ser Phe Leu 

320 325 330 

Met Lys Pro lie Asn Lys Cys lie Gly Arg Asn Met Thr Tyr Phe 

335 340 345 

Ser Gly Leu Leu Val lie Leu Ala Phe Ala Ala Trp Val Ala Leu 

350 355 360 

Ala Glu Gly Leu Gly Val Ala Val Tyr Ala Ala Ala Val Leu Leu 

365 370 375 

Gly Ala Gly Cys Ala Thr He Leu Val Thx Ser Leu Ala Met Thr 

380 385 390 

Ala Asp Leu He Gly Pro His Thr Asn Ser Gly Ala Phe Val Tyr 

395 400 405 

Gly Ser Met Ser Phe Leu Asp Lys Val Ala Asn Gly Leu Ala Val 

410 415 420 

Met Ala He Gin Ser Leu His Pro Cys Pro Ser Glu Leu Cys Cys 

425 430 435 

Arg Ala Cys Val Ser Phe Tyr His Trp Ala Met Val Ala Val Thr 

440 445 450 

Gly Gly Val Gly Val Ala Ala Ala Leu Cys Leu Cys Ser Leu Leu 

455 460 465 

Leu Trp Pro Thr Arg Leu Arg Arg Trp Asp Arg Asp Ala Arg Pro 

470 475 480 



<210> 19 

<211> 381 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 2395967CD1 

<400> 19 

Met Ser Glu Phe Trp Leu He Ser Ala Pro Gly Asp Lys Glu Asn 
15 10 15 

Leu Gin Ala Leu Glu Arg Met Asn Thr Val Thr Ser Lys Ser Asn 
20 25 30 

Leu Ser Tyr Asn Thr Lys Phe Ala He Pro Asp Phe Lys Val Gly 
35 40 45 

Thr Leu Asp Ser Leu Val Gly Leu Ser Asp Glu Leu Gly Lys Leu 
50 55 60 

Asp Thr Phe Ala Glu Ser Leu He Arg Arg Met Ala Gin Ser Val 
65 70 75 

Val Glu Val Met Glu Asp Ser Lys Gly Lys Val Gin Glu His Leu 
80 85 90 

Leu Ala Asn Gly Val Asp Leu Thr Ser Phe Val Thr His Phe Glu 
'95 100 105 

Trp Asp Met Ala Lys Tyr Pro Val Lys Gin Pro Leu Val Ser Val 
110 115 120 

Val Asp Thr He Ala Lys Gin Leu Ala Gin He Glu Met Asp Leu 
125 130 135 

Lys Ser Arg Thr Ala Ala Tyr Asn Thr Leu Lys Thr Asn Leu Glu 
140 145 150 

Asn Leu Glu Lys Lys Ser Met Gly Asn Leu Phe Thr Arg Thr Leu 
155 160 165 
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Ser Asp He Val Ser Lys Glu Asp Phe Val Leu Asp Ser Glu Tyr 

170 175 180 

Leu Val Thr Leu Leu Val He Val Pro Lys Pro Asn Tyr Ser Gin 

185 190 195 

Trp Gin Lys Thr Tyr Glu Ser Leu Ser Asp Met Val Val Pro Arg 

200 205 210 

Ser Thr Lys Leu He Thr Glu Asp Lys Glu Gly Gly Leu Phe Thr 

215 220 225 

Val Thr Leu Phe Arg Lys Val He Glu Asp Phe Lys Thr Lys Ala 

230 235 240 

Lys Glu Asn Lys Phe Thr Val Arg Glu Phe Tyr Tyr Asp Glu Lys 

245 250 255 

Glu He Glu Arg Glu Arg Glu Glu Met Ala Arg Leu Leu Ser Asp 

260 265 270 

Lys Lys Gin Gin Tyr Gly Pro Leu Leu Arg Trp Leu Lys Val Asn 

275 280 285 

Phe Ser Glu Ala Phe He Ala Trp He His He Lys Ala Leu Arg 

290 295 300 

Val Phe Val Glu Ser Val Leu Arg Tyr Gly Leu Pro Val Asn Phe 

305 310 315 

Gin Ala Val Leu Leu Gin Pro His Lys Lys Ser Ser Thr Lys Arg 

320 325 330 

Leu Arg Glu Val Leu Asn Ser Val Phe Arg His Leu Asp Glu Val 

335 340 345 

Ala Ala Thr Ser He Leu Asp Ala Ser Val Glu He Pro Gly Leu 

350 355 360 

Gin Leu Asn Asn Gin Asp Tyr Phe Pro Tyr Val Tyr Phe His He 

365 370 375 

Asp Leu Ser Leu Leu Asp 

380 

<210> 20 

<211> 484 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3586648CD1 

<400> 20 

Met Tyr Thr Ser His Glu Asp He Gly Tyr Asp Phe Glu Asp Gly 
15 10 15 

Pro Lys Asp Lys Lys Thr Leu Lys Pro His Pro Asn He Asp Gly 

20 25 30 

Gly Trp Ala Trp Met Met Val Leu Ser Ser Phe Phe Val His He 

35 40 45 

Leu He Met Gly Ser Gin Met Ala Leu Gly Val Leu Asn Val Glu 

50 55 60 

Trp Leu Glu Glu Phe His Gin Ser Arg Gly Leu Thr Ala Trp Val 

65 70 75 

Ser Ser Leu Ser Met Gly He Thr Leu He Val Gly Pro Phe He 

80 85 90 

Gly Leu Phe He Asn Thr Cys Gly Cys Arg Gin Thr Ala He He 

95 100 105 

Gly Gly Leu Val Asn Ser Leu Gly Trp Val Leu Ser Ala Tyr Ala 
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110 115 120 

Ala Asn Val His Tyr Leu Phe lie Thr Phe Gly Val Ala Ala Gly 

125 130 135 

Leu Gly Ser Gly Met Ala Tyr Leu Pro Ala Val Val Met Val Gly 

140 145 150 

Arg Tyr Phe Gin Lys Arg Arg Ala Leu Ala Gin Gly Leu Ser Thr 

155 160 165 

Thr Gly Thr Gly Phe Gly Thr Phe Leu Met Thr Val Leu Leu Lys 

170 175 180 

Tyr Leu Cys Ala Glu Tyr Gly Trp Arg Asn Ala Met Leu lie Gin 

185 190 195 

Gly Ala Val Ser Leu Asn Leu Cys Val Cys Gly Ala Leu Met Arg 

200 205 210 

Pro Leu Ser Pro Gly Lys Asn Pro Asn Asp Pro Gly Glu Lys Asp 

215 220 225 

Val Arg Gly Leu Pro Ala His Ser Thr Glu Ser Val Lys Ser Thr 

230 235 240 

Gly Gin Gin Gly Arg Thr Glu Glu Lys Asp Gly Gly Leu Gly Asn 

245 250 255 

Glu Glu Thr Leu Cys Asp Leu Gin Ala Gin Glu Cys Pro Asp Gin 

260 265 270 

Ala Gly His Arg Lys Asn Met Cys Ala Leu Arg lie Leu Lys Thr 

275 280 285 

Val Ser Trp Leu Thr Met Arg Val Arg Lys Gly Phe Glu Asp Trp 

290 295 300 

Tyr Ser Gly Tyr Phe Gly Thr Ala Ser Leu Phe Thr Asn Arg Met 

305 310 315 

Phe Val Ala Phe He Phe Trp Ala Leu Phe Ala Tyr Ser Ser Phe 

320 325 330 

Val He Pro Phe He His Leu Pro Glu lie Val Asn Leu Tyr Asn 

335 340 345 

Leu Ser Glu Gin Asn Asp Val Phe Pro Leu Thr Ser He He Ala 

350 355 360 

He Val His He Phe Gly Lys Val He Leu Gly Val He Ala Asp 

365 * 370 375 

Leu Pro Cys He Ser Val Trp Asn Val Phe Leu Leu Ala Asn Phe 

380 385 390 

Thr Leu Val Leu Ser He Phe He Leu Pro Leu Met His Thr Tyr 

395 400 405 

Ala Gly Leu Ala Val He Cys Ala Leu He Gly Phe Ser Ser Gly 

410 415 420 

Tyr Phe Ser Leu Met Pro Val Val Thr Glu Asp Leu Val Gly He 

425 430 435 

Glu His Leu Ala Asn Ala Tyr Gly He He He Cys Ala Asn Gly 

440 445 450 

He Ser Ala Leu Leu Gly Pro Pro Phe Ala Gly Lys Leu Ser Glu 

455 460 465 

Val Leu Arg Ala Gin Ser Ala Cys Thr Tyr Gly Ala Leu Cys Tyr 

470 475 480 

Lys Val Pro Asp 



<210> 21 

<211> 736 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: • 7473396CD1 



<400> 21 

Met Gin Asn lie Thr Lys Glu Phe Gly Thr Phe Lys Ala Asn Asp 
15 10 15 

Asn He Asn Leu Gin Val Lys Ala Gly Glu He His Ala Leu Leu 
20 25 30 

Gly Glu Asn Gly Ala Gly Lys Ser Thr Leu Met Asn Val Leu Ser 
35 40 45 

Gly Leu Leu Glu Pro Thr Ser Gly Lys He Leu Met Arg Gly Lys 
50 55 60 

Glu Val Gin He Thr Ser Pro Thr Lys Ala Asn Gin Leu Gly He 
65 70 75 

Gly Met Val His Gin His Phe Met Leu Val Asp Ala Phe Thr Val 
80 85 90 

Thr Glu Asn He Val Leu Gly Ser Glu Pro Ser Arg Ala Gly Met 
95 100 105 

Leu Asp His Lys Lys Ala Arg Lys Glu He Gin Lys Val Ser Glu 

110 115 120 

Gin Tyr Gly Leu Ser Val Asn Pro Asp Ala Tyr Val Arg Asp He 

125 130 135 

Ser Val Gly Met Glu Gin Arg Val Glu He Leu Lys Thr Leu Tyr 

140 145 150 

Arg Gly Ala Asp Val Leu He Phe Asp Glu Pro Thr Ala Val Leu 

155 160 165 

Thr Pro Gin Glu He Asp Glu Leu lie Val He Met Lys Glu Leu 

170 175 180 

Val Lys Glu Gly Lys Ser He He Leu He Thr His Lys Leu Asp 

185 190 195 

Glu He Lys Ala Val Ala Asp Arg Cys Thr Val He Arg Arg Gly 

200 205 210 

Lys Gly He Gly Thr Val Asn Val Lys Asp Val Thr Ser Gin Gin 

215 220 225 

Leu Ala Asp Met Met Val Gly Arg Ala Val Ser Phe Lys Thr Met 

230 235 240 

Lys Lys Glu Ala Lys Pro Gin Glu Val Val Leu Ser He Glu Asn 

245 250 255 

Leu Val Val Lys Glu Asn Arg Gly Leu Glu Ala Val Lys Asn Leu 

260 265 270 

Asn Leu Glu Val Arg Ala Gly Glu Val Leu Gly He Ala Gly He 

275 280 285 

Asp Gly Asn Gly Gin Ser Glu Leu He Gin Ala Leu Thr Gly Leu 

290 295 300 

Arg Lys Ala Glu Ser Gly His He Lys Leu Lys Gly Glu Asp He 

305 310 315 

Thr Asn Lys Lys Pro Arg Lys He Thr Glu His Gly Val Gly His 

320 325 330 

Val Pro Glu Asp Arg His Lys Tyr Gly Leu Val Leu Asp Met Thr 

335 340 345 

Leu Ser Glu Asn He Ala Leu Gin Thr Tyr His Gin Lys Pro Tyr 

350 355 360 

Ser Lys Asn Gly Met Leu Asn Tyr Ser. Val He Asn Glu His Ala 

365 370 375 

Arg Glu Leu He Glu Glu Tyr Asp Val Arg Thr Thr Asn Glu Leu 
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380 






385 








390 


Val Pro Ala Lys Ala Leu Ser 


Gly Gly 


Asn 


Gin Gin Lys 


Ala 


He 


395 






400 








405 


He Ala Arg He Val Asp Arg 


Asp 


Pro 


Asp 


Leu 


Leu He 


Val 


Ala 


410 






415 








420 


Asn Pro Thr Arg Gly Leu Asp 


Val 


Gly 


Glu 


Phe 


Val Ala 


Val 


Thr 


425 






430 








435 


Gly Val Ser Gly Ser Gly Lys 


Ser 


Thr 


Leu 


Val 


Asn Ser 


He 


Leu 


440 






445 








450 


Lys Lys Ser Leu Ala Gin Lys 


Leu 


Asn 


Lys 


Asn 


Ser Ala 


Lys 


Pro 


455 






460 








465 


Gly Lys Phe Lys Thr He Ser 


Gly Tyr 


Glu 


Ser 


He Glu 


Lys 


He 


470 






475 








480 


He Asp He Asp Gin Ser Pro 


He Gly 


Arg 


Thr 


Pro Arg 


Ser 


Asn 


485 






490 








495 


Pro Ala Thr Tyr Thr Ser Val 


Phe Asp 


Asp 


He Arg Gly 


Leu 


Phe 


500 






505 








510 


Ala Gin Thr Asn Glu Ala Lys 


Met Arg 


Gly 


Tyr 


Lys Lys 


Gly Arg 


515 






520 








525 


Phe Ser Phe Asn Val Lys Gly 


Gly Arg 


Cys 


Glu Ala Cys 


Arg Gly 


530 






535 








540 


Asp Gly He He Lys He Glu 


Met 


His 


Phe 


Leu 


Pro Asp 


Val 


Tyr 


545 






550 








555 


Val Pro Cys Glu Val Cys His 


Gly Lys 


Arg 


Tyr 


Asn Ser 


Glu 


Thr 


560 






565 








570 


Leu Glu Val His Tyr Lys Gly 


Lys 


Ser 


lie 


Ala 


Asp He 


Leu 


Glu 


575 






580 








585 


Met Thr Val Glu Asp Ala Val 


Glu 


Phe 


Phe 


Lys 


His He 


Pro 


Lys 


590 






595 








600 


He His Arg Lys Leu Gin Thr 


He 


Val 


Asp 


Val 


Gly Leu 


Gly Tyr 


605 






610 








615 


Val Thr Met Gly Gin Pro Ala 


Thr 


Thr 


Leu 


Ser Gly Gly 


Glu 


Ala 


620 






625 








630 


Gin Arg Met Lys Leu Ala Ser 


Glu 


Leu 


His 


Lys 


He Ser 


Asn Gly 


635 






640 








645 


Lys Asn Phe Tyr He Leu Asp 


Glu 


Pro 


Thr 


Thr 


Gly Leu 


His 


Ser 


650 






655 








660 


Asp Asp He Ala Arg Leu Leu 


His 


Val 


Leu 


Gin 


Arg Leu 


Val 


Asp 


665 






670 








675 


Ala Gly Asn Thr Val Leu Val 


He 


Glu 


His 


Asn 


Leu Asp 


Val 


He 


680 






685 








690 


Lys Thr Ala Asp Tyr lie He 


Asp 


Leu 


Gly 


Pro Glu Gly 


Gly Glu 


695 






700 








705 


Gly Gly Gly Thr He Leu Thr 


Thr Gly 


Thr 


Pro 


Glu Glu 


He 


He 


710 






715 








720 


Asn Val Lys Glu Ser Tyr Thr 


Gly His 


Tyr 


Leu 


Lys Lys 


He 


Met 


725 






730 








735 



Val 



<210> 22 

<211> 465 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> misc_feature 

<223> Incyte ID No: 7476283CD1 

<400> 22 

Met Gly Pro Leu Lys Ala Phe Leu Phe Ser Pro Phe Leu Leu Arg 
15 10 15 

Ser Gin Ser Arg Gly Val Arg Leu Val Phe Leu Leu Leu Thr Leu 
20 25 30 

His Leu Gly Asn Cys Val Asp Lys Ala Asp Asp Glu Asp Asp Glu 
35 40 45 

Asp Leu Lys Val Asn Lys Thr Trp Val Leu Ala Pro Lys lie His 
50 55 60 

Glu Gly Asp He Thr Gin He Leu Asn Ser Leu Leu Gin Gly Tyr 
65 70 75 

Asp Asn Lys Leu Arg Pro Asp He Gly Val Arg Pro Thr Val He 
80 85 90 

Glu Thr Asp Val Tyr Val Asn Ser He Gly Pro Val Asp Pro He 
95 100 105 

Asn Met Glu Tyr Thr He Asp He He Phe Ala Gin Thr Trp Phe 
110 115 120 

Asp Ser Arg Leu Lys Phe Asn Ser Thr Met Lys Val Leu Met Leu 
125 130 135 

Asn Ser Asn Met Val Gly Lys He Trp He Pro Asp Thr Phe Phe 
140 145 150 

Arg Asn Ser Arg Lys Ser Asp Ala His Trp He Thr Thr Pro Asn 
155 160 165 

Arg Leu Leu Arg He Trp Asn Asp Gly Arg Val Leu Tyr Thr Leu 
170 175 180 

Arg Leu Thr He Asn Ala Glu Cys Tyr Leu Gin Leu His Asn Phe 
185 190 195 

Pro Met Asp Glu His Ser Cys Pro Leu Glu Phe Ser Ser Asp Gly 



Glu Val Ala Asp Pro Lys Tyr Trp Arg Leu Tyr Gin Phe Ala Phe 

230 235 240 

Val Gly Leu Arg Asn Ser Thr Glu He Thr His Thr He Ser Gly 

245 250 255 

Asp Tyr Val He Met Thr He Phe Phe Asp Leu Ser Arg Arg Met 

260 265 270 

Gly Tyr Phe Thr He Gin Thr Tyr He Pro Cys He Leu Thr Val 

275 280 285 

Val Leu Ser Trp Val Ser Phe Trp He Asn Lys Asp Ala Val Pro 

290 295 300 

Ala Arg Thr Ser Leu Gly He Thr Thr Val Leu Thr Met Thr Thr 

305 310 315 

Leu Ser Thr He Ala Arg Lys Ser Leu Pro Lys Val Ser Tyr Val 

320 325 330 

Thr Ala Met Asp Leu Phe Val Ser Val Cys Phe He Phe Val Phe 

335 340 345 

Ala Ala Leu Met Glu Tyr Gly Thr Leu His Tyr Phe Thr Ser Asn 

350 355 360 

Gin Lys Gly Lys Thr Ala Thr Lys Asp Arg Lys Leu Lys Asn Lys 

365 370 375 

Ala Ser Met Thr Pro Gly Leu His Pro Gly Ser Thr Leu He Pro 



200 . 

Tyr Pro Lys Asn Glu He 
215 



Glu 



205 

Tyr Lys Trp Lys Lys 
220 



210 

Pro Ser Val 
225 



380 



385 



390 
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Met Asn Asn lie Ser Val Pro Gin Glu Asp Asp Tyr Gly Tyr Gin 

395 400 405 

Cys Leu Glu Gly Lys Asp Cys Ala Ser Phe Phe Cys Cys Phe Glu 

410 415 420 - 

Asp Cys Arg Thr Gly Ser Trp Arg Glu Gly Arg He His He Arg 

425 430 435 

He Ala Lys He Asp Ser Tyr- Ser Arg He Phe Phe Pro Thr Ala 

440 445 450 

Phe Ala Leu Phe Asn Leu Val Tyr Trp Val Gly Tyr Leu Tyr Leu 

455 460 465 



<210> 23 

<211> 235 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_ f eature 

<223> Incyte ID No; 7477105CD1 

<400> 23 



Met 


Gly Ser 


Val Gly 


Ser 


Gin Arg 


Leu 


Glu Glu Pro Ser 


Val Ala 


1 




5 








10 


15 


Gly 


Thr Pro 


Asp Pro 


Gly 


Val Val 


Met 


Ser Phe Thr Phe Asp Ser 






20 








25 


30 


His 


Gin Leu 


Glu Glu 


Ala 


Ala Glu 


Ala 


Ala Gin Gly Gin Gly Leu 






35 








40 


45 


Arg 


Ala Arg 


Gly Val 


Pro 


Ala Phe 


Thr 


Asp Thr Thr Leu 


Asp Glu 






50 








55 


60 


Pro 


Val Pro 


Asp Asp 


Arg 


Tyr His Ala 


He Tyr Phe Ala Met Leu 






65 








70 


75 


Leu 


Ala Gly 


Val Gly 


Phe 


Leu Leu 


Pro 


Tyr Asn Ser Phe 


He Thr 






80 








85 


90 


Asp 


Val Asp 


Tyr Leu 


His 


His Lys 


Tyr 


Pro Gly Thr Ser 


He Val 






95 








100 


105 


Phe 


Asp Met 


Ser Leu 


Thr 


Tyr He 


Leu 


Val Ala Leu Ala 


Ala Val 






110 








115 


120 


Leu 


Leu Asn 


Asn Val 


Leu 


Val Glu Arg 


Leu Thr Leu His 


Thr Arg 






125 








130 


135 


He 


Thr Ala 


Gly Tyr 


Leu 


Leu Ala 


Leu 


Gly Pro Leu Leu 


Phe He 






140 








145 


150 


Ser 


He Cys 


Asp Val 


Trp 


Leu Gin 


Leu 


Phe Ser Arg Asp 


Gin Ala 






155 








160 


165 


Tyr 


Ala He 


Asn Leu 


Ala 


Ala Val 


Gly 


Thr Val Ala Phe Gly Cys 






170 








175 


180 


Thr 


Val Gin 


Gin Ser 


Ser 


Phe Tyr Gly 


His Arg Leu Ala Gin Pro 






185 








190 


195 


Pro 


Pro Gly 


Thr Pro 


Pro 


His Glu 


Leu 


Trp Ser Pro Glu Arg Arg 






200 








205 


210 


Gly 


Ala Ala 


Pro His 


Leu 


Val Thr 


Leu 


Arg Ala Ser Pro 


Ser Val 






215 








220 


225 


Leu 


He Leu 


Arg Asp 


Cys 


Phe Ser 


Gin 


Thr 








230 








235 





<210> 24 
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<211> 662 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7482079CD1 



<400> 24 

Met Leu Lys Gin Ser Glu Arg Arg Arg Ser Trp Ser Tyr Arg Pro 
15 10 15 

Trp Asn Thr Thr Glu Asn Glu Gly Ser Gin His Arg Arg Ser He 
20 25 30 

Cys Ser Leu Gly Ala Arg Ser Gly Ser Gin Ala Ser He His Gly 
35 40 45 

Trp Thr Glu Gly Asn Tyr Asn Tyr Tyr He Glu Glu Asp Glu Asp 
50 55 60 

Gly Glu Glu Glu Asp Gin Trp Lys Asp Asp Leu Ala Glu Glu Asp 
65 70 75 

Gin Gin Ala Gly Glu Val Thr Thr Ala Lys Pro Glu Gly Pro Ser 
80 85 90 

Asp Pro Pro Ala Leu Leu Ser Thr Leu Asn Val Asn Val Gly Gly 
95 100 105 

His Ser Tyr Gin Leu Asp Tyr Cys Glu Leu Ala Gly Phe Pro Lys 

110 115 . 120 

Thr Arg Leu Gly Arg Leu Ala Thr Ser Thr Ser Arg Ser Arg Gin 

125 130 135 

Leu Ser Leu Cys Asp Asp Tyr Glu Glu Gin Thr Asp Glu Tyr Phe 

140 145 150 

Phe Asp Arg Asp Pro Ala Val Phe Gin Leu Val Tyr Asn Phe Tyr 

155 160 165 

Leu Ser Gly Val Leu Leu Val Leu Asp Gly Leu Cys Pro Arg Arg 

170 175 180 

Phe Leu Glu Glu Leu Gly Tyr Trp Gly Val Arg Leu Lys Tyr Thr 

185 190 195 

Pro Arg Cys Cys Arg He Cys Phe Glu Glu Arg Arg Asp Glu Leu 

200 205 210 

Ser Glu Arg Leu Lys He Gin His Glu Leu Arg Ala Gin Ala Gin 

215 220 225 

Val Glu Glu Ala Glu Glu Leu Phe' Arg Asp Met Arg Phe Tyr Gly 

230 235 240 

Pro Gin Arg Arg Arg Leu Trp Asn Leu Met Glu Lys Pro Phe Ser 

245 250 255 

Ser Val Ala Ala Lys Ala He Gly Val Ala Ser Ser Thr Phe Val 

260 265 270 

Leu Val Ser Val Val Ala Leu Ala Leu Asn Thr Val Glu Glu Met 

275 280 285 

Gin Gin His Ser Gly Gin Gly Glu Gly Gly Pro Asp Leu Arg Pro 

290 295 300 

He Leu Glu His Val Glu Met Leu Cys Met Gly Phe Phe Thr Leu 

305 310 315 

Glu Tyr Leu Leu Arg Leu Ala Ser Thr Pro Asp Leu Arg Arg Phe 

320 325 330 

Ala Arg Ser Ala Leu Asn Leu Val Asp Leu Val Ala He Leu Pro 

335 . 340 345 

Leu Tyr Leu Gin Leu Leu Leu Glu Cys Phe Thr Gly Glu Gly His 
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350 355 360 
Gin Arg Gly Gin Thr Val Gly Ser Val Gly Lys Val Gly Gin Val 

365 370 375 

Leu Arg Val Met Arg Leu Met Arg He Phe Arg He Leu Lys Leu 

380 385 390 

Ala Arg His Ser Thr Gly Leu Arg Ala Phe Gly Phe Thr Leu Arg 

395 400 405 

Gin Cys Tyr Gin Gin Val Gly Cys Leu Leu Leu Phe He Ala Met 

410 415 420 

Gly He Phe Thr Phe Ser Ala Ala Val Tyr Ser Val Glu His Asp 

425 430 435 

Val Pro Ser Thr Asn Phe Thr Thr He Pro His Ser Trp Trp Trp 

440 445 450 

Ala Ala Val Ser Thr Phe Ala Leu Gly Phe Pro He Leu Phe Pro 

455 460 465 

Ser Pro Val Ser Cys Ser Ser Leu Pro Trp Leu Ser Ala Thr Arg 

470 475 480 

Leu Trp Leu Leu He Leu Val Phe Pro Pro Thr Pro Asn Arg Arg 

485 490 495 

He Gin Leu Thr Lys Arg Arg Trp Met Ser Lys Val Val Glu Arg 

500 505 510 

Glu Leu Ser Arg Ser Val Asn Ser Ser Ser His Met Ser Met Ala 

515 520 525 

Val Ala Lys Asn Lys Arg Glu Asn Ala Ser Pro He Met Gin Thr 

530 535 540 

Leu His Lys Phe Leu Phe Met Ala Phe Ala Gin Pro He Gly Gin 

545 550 555 

Ser Lys Ser His Gly Gin Ala Ala Ser Gin Arg Ala Gly Gin Val 

560 565 570 

Ser He Ser Thr Val Gly Tyr Gly Asp Met Tyr Pro Glu Thr His 

575 580 585 

Leu Gly Arg Phe Phe Ala Phe Leu Cys He Ala Phe Gly He He 

590 595 600 

Leu Asn Gly Met Pro He Ser He Leu Tyr Asn Lys Phe Ser Asp 

605 610 615 

Tyr Tyr Ser Lys Leu Lys Ala Tyr Glu Tyr Thr Thr He Arg Arg 

620 625 630 

Glu Arg Gly Glu Val Asn Phe Met Gin Arg Ala Arg Lys Lys He 

635 640 645 

Ala Glu Cys Leu Leu Gly Ser Asn Pro Gin Leu Thr Pro Arg Gin 

650 655 660 

Glu Asn 



<210> 25 

<211> 371 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 55145506CD1 

<400> 25 

Met Asn Asp Glu Asp Tyr Ser Thr He Tyr Asp Thr He Gin Asn 
15 10 15 
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Glu Arg Thr Tyr Glu Val Pro Asp Gin Pro Glu Glu Asn Glu Ser 
20 25 30 

Pro His Tyr Asp Asp Val His Glu Tyr Leu Arg Pro Glu Asn Asp 
35 40 45 

Leu Tyr Ala Thr Gin Leu Asn Thr His Glu Tyr Asp Phe Val Ser 
50 55 60 

Val Tyr Thr lie Lys Gly Glu Glu Thr Ser Leu Ala Ser Val Gin 
65 70 75 

Ser Glu Asp Arg Gly Tyr Leu Leu Pro Asp Glu He Tyr Ser Glu 
80 85 ' 90 

Leu Gin Glu Ala His Pro Gly Glu Pro Gin Glu Asp Arg Gly He 
95 100 105 

Ser Met Glu Gly Leu Tyr Ser Ser Ala Gin Asp Gin Gin Leu Cys 

110 115 120 

Ala Ala Glu Leu Gin Glu Asn Gly Ser Val Met Lys Glu Asp Leu 

125 130 135 

Pro Ser Pro Ser Ser Phe Thr He Gin His Ser Lys Ala Phe Ser 

140 145 150 

Thr Thr Lys Tyr Ser Cys Tyr Ser Asp Ala Glu Gly Leu Glu Glu 

155 160 165 

Lys Glu Gly Ala His Met Asn Pro Glu He Tyr Leu Phe Val Lys 

170 175 180 

Ala Gly He Asp Gly Glu Ser He Gly Asn Cys Pro Phe Ser Gin 

185 190 195 

Arg Leu Phe Met He Leu Trp Leu Lys Gly Val Val Phe Asn Val 

200 205 210 

Thr Thr Val Asp Leu Lys Arg Lys Pro Ala Asp Leu His Asn Leu 

215 220 225 

Ala Pro Gly Thr His Pro Pro Phe Leu Thr Phe Asn Gly Asp Val 

230 235 240 

Lys Thr Asp Val Asn Lys He Glu Glu Phe Leu Glu Glu Thr Leu 

245 250 255 

Thr Pro Glu Lys Tyr Pro Lys Leu Ala Ala Lys His Arg Glu Ser 

260 265 270 

Asn Thr Ala Gly He Asp He Phe Ser Lys Phe Ser Ala Tyr lie 

275 280 285 

Lys Asn Thr Lys Gin Gin Asn Asn Ala Ala Leu Glu Arg Gly Leu 

290 295 300 

Thr Lys Ala Leu Lys Lys Leu Asp Asp Tyr Leu Asn Thr Pro Leu 

305 310 315 

Pro Glu Glu He Asp Ala Asn Thr Cys Gly Glu Asp Lys Gly Ser 

320 325 330 

Arg Arg Lys Phe Leu Asp Gly Asp Glu Leu Thr Leu Ala Asp Cys 

335 340 345 

Asn Leu Leu Pro Lys Leu His Val Val Lys Thr His Leu Leu Thr 

350 355 360 

Ser Ser Ser Asn Phe Leu Arg Asn Lys Tyr His 

365 370 

<210> 26 

<211> 468 

<212> PRT 

<213> Homo sapiens 

<220> 
. <221> misc__f eature 



45/71 



WO 02/22684 



PCT/US01/28938 



<223> Incyte ID No: 5950519CD1 
<400> 26 

Met Arg Gly Ser Pro Gly Asp Ala Glu Arg Arg Gin Arg Trp Gly 
1 5 10 15 

Arg Leu Phe Glu Glu Leu Asp Ser Asn Lys Asp Gly Arg Val Asp 
20 25 30 

Val His Glu Leu Arg Gin Gly Leu Ala Arg Leu Gly Gly Gly Asn 
35 40 45 

Pro Asp Pro Gly Ala Gin Gin Gly lie Ser Ser Glu Gly Asp Ala 
50 55 60 

Asp Pro Asp Gly Gly Leu Asp Leu Glu Glu Phe Ser Arg Tyr Leu 
65 70 75 

Gin Glu Arg Glu Gin Arg Leu Leu Leu Met Phe His Ser Leu Asp 
80 85 90 

Arg Asn Gin Asp Gly His lie Asp Val Ser Glu lie Gin Gin Ser 
95 100 105 

Phe Arg Ala Leu Gly lie Ser He Ser Leu Glu Gin Ala Glu Lys 

110 115 120 

He Leu His Ser Met Asp Arg Asp Gly Thr Met Thr He Asp Trp 

125 130 135 

Gin Glu Trp Arg Asp His Phe Leu Leu His Ser Leu Glu Asn Val 

140 145 150 

Glu Asp Val Leu Tyr Phe Trp Lys His Ser Thr Val Leu Asp He 

155 160 165 

Gly Glu Cys Leu Thr Val Pro Asp Glu Phe Ser Lys Gin Glu Lys 

170 • 175 180 

Leu Thr Gly Met Trp Trp Lys Gin Leu Val Ala Gly Ala Val Ala 

185 190 - 195 

Gly Ala Val Ser Arg Thr Gly Thr Ala Pro Leu Asp Arg Leu Lys 

200 205 210 

Val Phe Met Gin Val His Ala Ser Lys Thr Asn Arg Leu Asn He 

215 220 225 

Leu Gly Gly Leu Arg Ser Met Val Leu Glu Gly Gly He Arg Ser 

230 235 240 

Leu Trp Arg Gly Asn Gly He Asn Val Leu Lys He Ala Pro Glu 

245 250 255 

Ser Ala He Lys Phe Met Ala Tyr Glu Gin He Lys Arg Ala He 

260 265 270 

Leu Gly Gin Gin Glu Thr Leu His Val Gin Glu Arg Phe Val Ala 

275 280 285 

Gly Ser Leu Ala Gly Ala Thr Ala Gin Thr He He Tyr Pro Met 

290 295 300 

Glu Val Leu Lys Thr Arg Leu Thr Leu Arg Arg Thr Gly Gin Tyr 

.305 310 315 

Lys Gly Leu Leu Asp Cys Ala Arg Arg He Leu Glu Arg Glu Gly 

320 325 330 

Pro Arg Ala Phe Tyr Arg Gly Tyr Leu Pro Asn Val Leu Gly He 

335 340 345 

He Pro Tyr Ala Gly He Asp Leu Ala Val Tyr Glu Thr Leu Lys 

350 355 360 

Asn Trp Trp Leu Gin Gin Tyr Ser His Asp Ser Ala Asp Pro Gly 

365 370 375 

He Leu Val Leu Leu Ala Cys Gly Thr He Ser Ser Thr Cys Gly 

380 385 390 

Gin He Ala Ser Tyr Pro Leu Ala Leu Val Arg Thr Arg Met Gin 
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Ann 




Ala 


Gin Ala 


OCX 


IXC 


(Z"\ n CZl \r CZ~\ \r Prr» CZl n T.oii 

UlU ""'"*• J; VJXjf x X U VJXI1 XlCU 


Co>- Mot* T^en fil vr 

OCX UsU XJCIL VJlj 










415 


420 


Leu 


Leu Arg 


His 


lie 


Leu Ser Gin Glu Glv Met 


Aro Glv Leu Tvr 








425 


430 


435 


Arg Gly lie 


Ala 


Pro 


Asn Phe Met Lys Val He 


Pro Ala Val Ser 








440 


445 


450 


He 


Ser Tyr 


Val 


Val 


Tyr Glu Asn Met Lys Gin 


Ala Leu Gly Val 








455 


460 


465 


Thr 


Ser Arg 











<210> 27 

<211> 2229 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1687189CB1 

<400> 27 

gcctgagcgg ccgaactcgg cagctccaac 
ccagtccaag actctgtgct ccctaggttt 
cctgctagga tgccgtggca agcatttcgc 
acactggagt caggcatggc tgagactcgc 
gtggccctgg gtgtgggcag cacattgggt 
gccaaagata aagcagggcc atccattgtg 
gtgttggctg ggctgtgcta tgcggagttt 
tatctctaca gctatgtcac tgtgggtgaa 
atcctctcct atgtcattgg tacagccagt 
aacctgattg ggaaccacat ctctaagact 
catgtccttg cagaatatcc agatttcttt 
ttgttggctc tcggggctag tgagtcggcc 
cttttggttc ttgggttcgt catgatctct 
aagctcacag aagaggacta cgaattggcc 
ggtcctctgg gctctggagg atttgtgcct 
gcgacctgtt tctatgcatt tgttggtttc 
cagaatcccc agcgttccat cccgatgggc 
gcgtattttg ctgtctcttc tgcactcacc 
gagagccctt tgcctgaggc atttctctac 
gctgttggct ccctctgtgc tctttctacc 
cgggtgatct acgcgatggc agaggatggc 
accggcacac gcaccccaat catagccacc 
gcattcctct tcaaactcac tgatcttgtg 
tactccctgg tgtcgatttg tgttctcatc 
actggggaag aagtggagtt gcaggaggag 
ctatggggac tatttttccc actcaactcc 
tatgtttgtt cctcattgct tgctgtcctg 
tggtcagttc cattgctttc tggagacctg 
ctgctcatta ttgggatcat tgtggtcatc 
cactttaagg tgcctgcttt gcctctcctc 
cttatgatgc agatgacagc tggtacctgg 
tttgctatct acttcggcta tgggatccag 
ccctcacgca agtctagagc caaaactgta 
tcagtttgac atcgtcacac ctaaatgctg 



ccaactcggc ttaactccgc ctcaccgagc 60 
gcaacagctc tctgatcatc ttcttcaatt 120 
agatttggtc aaaagctggt acgcagacgt 180 
cttgccagat gcctaagcac cctggattta 240 
gcaggcgtgt atgtcctagc tggcgaggtg 300 
atctgctttt tggtggctgc cctgtcttct 360 
ggtgcccggg ttccccgttc tggttcggca 420 
ctctgggcct tcaccactgg ctggaacctc 480 
gtggcccggg cctggagctc tgcttttgac 540 
ctgcaggggt ccattgcact gcacgtgccc 600 
gctttgggcc tcgtgttgct gctcactgga 660 
ctggttacca aagtgttcac aggcgtgaac 720 
ggcttcgtta agggggacgt gcacaactgg 780 
atggctgaac tcaatgacac ctatagcttg 840 
ttcggcttcg agggaattct ccgtggagca 900 
gactgtattg ctaccactgg agaagaagcc 960 
attgtgatct cactgtctgt ctgctttttg 1020 
ctgatgatgc cttactacca gcttcagcct 1080 
attggatggg ctcctgcccg ctatgttgtg 1140 
agcctcctgg gctccatgtt ccccatgcct 1200 
ctcctgttcc gtgtacttgc tcggatccac 1260 
gtggtctctg gcattattgc agcattcatg 1320 
gacctcatgt caattgggac cctgcttgct 1380 
ctcaggtatc aacctgatca ggagacaaag 1440 
gcaataacta ctgaatcaga gaagttgacc 1500 
atccccactc cactctctgg- ccaaattgtc 1560 
ctgactgctc tttgcctggt gctggcccag 1620 
ctgtggactg cagtggttgt gctgctcctg 1680 
tggagacagc cacagagctc cactcccctt 1740 
ccactaatga gcatctttgt gaatatttac 1800 
gcccgatttg gggtctggat gctgattggc 1860 
cacagcctgg aagagattaa gagtaaccaa 1920 
gaccttgatc ccggcactct ctatgtccac 1980 
tctggtcccc tgcacaataa tggagagtac 2040 
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tcctgacccc agtgacagct agccctcccc 
tctgtacgat gtgaaggatg tgtctttgct 
taaatgatgt ctagctgctt accaacttta 
aaaaaaaaa 



PCT/US01/28938 

tgtgatggtg gtggtggata ctaatacagt 2100 
atttcttgtc tattttaacc cgtctgcttc 2160 
aaaaatgata ttaaaagaaa gtagaaaaat 2220 

2229 



<210> 28 

<211> 7610 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 70782 07CB1 

<400> 28 

gcgccccgcc cccgcgcggg cgatgcccag cggcgcggcg ggctgcgggg cccggcgggg 60 
cgcgcagagg agcgggccgc ggcgctgagg cggcggagcg tggccccgcc atgggcttcc 120 
tgcaccagct gcagctgctg ctctggaaga acgtgacgct caaacgccgg agcccgtggg 180 
tcctggcctt cgagatcttc atccccctgg tgctgttctt tatcctgctg gggctgcgac 240 
agaagaagcc caccatctcc gtgaaggaag tctccttcta cacagcggcg cccctgacgt 300 
ctgccggcat cctgcctgtc atgcaatcgc tgtgcccgga cggccagcga gacgagttcg 360 
gcttcctgca gtacgccaac tccacggtca cgcagctgct tgagcgcctg gaccgcgtgg 420 
tggaggaagg caacctgttt gacccagcgc ggcccagcct gggctcagag ctcgaggccc 480 
tacgccagca tctggaggcc ctcagtgcgg gcccgggcac ctcggggagc cacctggaca 540 
gatccacagt gtcttccttc tctctggact cggtggccag aaacccgcag gagctctggc 600 
gtttcctgac gcaaaacttg tcgctgccca atagcacggc ccaagcactc ttggccgccc 660 
gtgtggaccc gcccgaggtc taccacctgc tctttggtcc ctcatctgcc ctggattcac 720 
agtctggcct ccacaagggt caggagccct ggagccgcct agggggcaat cccctgttcc 780 
ggatggagga gctgctgctg gctcctgccc tcctggagca gctcacctgc acgccgggct 840 
cgggggagct gggccggatc ctcactgtgc ctgagagtca gaagggagcc ctgcagggct 900 
accgggatgc tgtctgcagt gggcaggctg ctgcgcgtgc caggcgcttc tctgggctgt 960 
ctgctgagct ccggaaccag ctggacgtgg ccaaggtctc ccagcagctg ggcctggatg 1020 
cccccaacgg ctcggactcc tcgccacagg cgccaccccc acggaggctg caggcgcttc 1080 
tgggggacct gctggatgcc cagaaggttc tgcaggatgt ggatgtcctg tcggccctgg 1140 
ccctgctact gccccagggt gcctgcactg gccggacccc cggaccccca gccagtggtg 1200 
cgggtggggc ggccaatggc actggggcag gggcagtcat gggccccaac gccaccgctg 1260 
aggagggcgc accctctgct gcagcactgg ccaccccgga cacgctgcag ggccagtgct 1320 
cagccttcgt acagctctgg gccggcctgc agcccatctt gtgtggcaac aaccgcacca 1380 
ttgaacccga ggcgctgcgg cggggcaaca tgagctccct gggcttcacg agcaaggagc 1440 
agcggaacct gggcctcctc gtgcacctca tgaccagcaa ccccaaaatc ctgtacgcgc 1500 
ctgcgggctc tgaggtcgac cgcgtcatcc tcaaggccaa cgagactttt gcttttgtgg 1560 
gcaacgtgac tcactatgcc caggtctggc tcaacatctc ggcggagatc cgcagcttcc 1620 
tggagcaggg caggctgcag caacacctgc gctggctgca gcagtatgta gcagagctgc 1680 
ggctgcaccc cgaggcactg aacctgtcac tggatgagct gccgccggcc ctgagacagg 1740 
acaacttctc gctgcccagt ggcatggccc tcctgcagca gctggatacc attgacaacg 1800 
cggcctgcgg ctggatccag ttcatgtcca aggtgagcgt ggacatcttc aagggcttcc 1860 
ccgacgagga gagcattgtc aactacaccc tcaaccaggc ctaccaggac aacgtcactg 1920 
tttttgccag tgtgatcttc cagacccgga aggacggctc gctcccgcct cacgtgcact 1980 
acaagatccg ccagaactcc agcttcaccg agaaaaccaa cgagatccgc cgcgcctact 2040 
ggcggcctgg gcccaatact ggcggccgct tctacttcct ctacggcttc gtctggatcc 2100 
aggacatgat ggagcgcgcc atcatcgaca cttttgtggg gcacgacgtg gtggagccag 2160 
gcagctacgt gcagatgttc ccctacccct gctacacacg cgatgacttc ctgtttgtca 2220 
ttgagcacat gatgccgctg tgcatggtga tctcctgggt ctactccgtg gccatgacca 2280 
tccagcacat cgtggcggag aaggagcacc ggctcaagga ggtgatgaag accatgggcc 2340 
tgaacaacgc ggtgcactgg gtggcctggt tcatcaccgg ctttgtgcag ctgtccatct 2400 
ccgtgacagc actcaccgcc atcctgaagt acggccaggt gcttatgcac agccacgtgg 2460 
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tcatcatctg gctcttcctg gcagtctacg 
tgtctgtgct gtactccaag gccaagctgg 
tgagctacgt gccctacatg tacgtggcga 
cggccttcga gaagtgcatc gcgtccctca 
agtacttcgc gctgtatgag gtggccggcg 
ccccggtgga gggggacgac ttcaacttgc 
ccgtggtcta tggcatcctc acgtggtaca 
tgccccggcc ctggtacttc ccactgcaga 
aagcctggga gtggagctgg ccgtgggcac 
accaggcctg tgccatggag agccggcgct 
ccacccacct gcctctggtt gtctgcgtgg 
agaagctggc cctgaacaag ctgagcctga 
tgggccacaa cggggcgggc aagaccacca 
caacgtcggg ttccgccacc atctacgggc 
gcaagaacct gggcatgtgc ccgcagcaca 
aacacctctg gttctactca cggctcaaga 
tggacaagat gatcgaggac ctggagctct 
tgtcgggtgg catgaagcgc aagctgtccg 
ccatcatcct ggacgagccc acggcgggcg 
acctcatcct gaagtacaag ccaggccgca 
aggctgacct gcttggggac cgcattgcca 
gctccccgct cttcctcaag ggcacctatg 
ggcccgccga gccggggggc ccccaagagc 
ccccgctgag cage tgc tec gagctccagg 
cctgcctgct ggtctcagac acaagcaegg 
ccaagaaggg ggctttcgag cgcctcttcc 
acctcagcag ettegggctg atggacacga 
aggaggatca gtcgctggag aacagtgagg 
tccctggggc ggagggcccg gcgtctgggg 
eggagctgae ccagtcgcag gcatcgctgc 
gcgacgaggg agctggctac accgacgtct 
cacaggaccc agacaatgtc agectgeaag 
gecagggcag ccgcaagctg gaeggegggt 
tggtcaaacg cttccactgc gcccgccgca 
tgccagcctt cttcgtctgc gtggccatga 
atctgccccc gctggtcctg tcaccttccc 
atttcatccc ctacgccaac gaggagegee 
ccagccccca geagctegtg agcacgttcc 
tgctcaagtc tcccgccaac ggctcgctgg 
cgcgcctgct ggeggctegg ttcttcgaca 
tgccactgtc caatttegtg ccacccccac 
ccccggatga ggacctgcag gcctggaacg 
tgtggacgtc ggcaccctcc ctgccgcgcc 
ctgcgcaggg caccggcttc tcctgcccca 
gggtggtcac aggegacate ctgaccgaca 
tcttcacctc cgaccgcttc cgactgcacc 
tgaagtccat cccagcctca tttggcacca 
tgegcaggge tgcccaggtt ttctacaaca 
tcaacagcct caacaacgcc atectgegtg 
eggcttaegg catcaccgtc accaaccacc 
tggattacct getgeaggge aeggatgteg 
ccttcgtgcc ggccagcttc gttgtcttcc 
atetgeagtt tgtcagegge tgcaacccca 
acatgetcaa ctacctggtc cccgctacct 
tgccggccta cacgtcgccc accaacttcc 
ggtggtccat cacgcccatc atgtacccgg 



cggtggccac catcatgttc tgcttcctgg 2520 
cctcggcctg cggtggcatc atctacttcc 2580 
tccgagagga ggtggcgcat gataagatca 2640 
tgtccacgac ggcctttggt ctgggctcta 2700 
tgggcatcca gtggcacacc ttcagccagt 2760 
tcctggctgt caccatgctg atggtggacg 2820 
ttgaggctgt gcacccaggc atgtacgggc 2880 
agtcctactg gctgggcagt gggcggacag 2940 
gcaccccccg cctcagtgtc atggaggagg 3000 
ttgaggagac ccgtggcatg gaggaggagc 3060 
acaaactcac caaggtctac aaggacgaca 3120 
acctctacga gaaccaggtg gtctccttct 3180 
ccatgtccat cctgaccggc ctgttccctc 3240 
acgacatccg caeggagatg gatgagatcc 3300 
atgtgctctt tgaccggctc acggtggagg 3360 
gcatggctca ggaggagatc cgcagagaga 3420 
ccaacaaacg gcactcactg gtgeagacat 3480 
tggccatcgc cttcgtgggc ggctctcgcg 3540 
tggaccccta cgcgcgccgc gccatctggg 3600 
ccatccttct gtccacccac cacatggatg 3660 
tcatctccca tgggaagctc aagtgctgcg 3720 
gcgacgggta ccgcctcacg ctggtcaagc 3780 
cagggctggc atccagcccc ecaggteggg 3840 
tgtcccagtt catccgcaag catgtggcct 3900 
agctctccta catcctgccc agegaggecg 3960 
agcacctgga gcgcagcctg gatgeactge 4020 
ccctggagga agtgttcctc aaggtgtcgg 4080 
ccgatgtgaa ggagtccagg aaggatgtgc 4140 
agggtcaege tggcaatctg gcccggtgct 4200 
agtcggcgtc atctgtgggc tctgcccgtg 4260 
atggcgacta ccgccccctc tttgataacc 4320 
aggtggaggc agaggecctg tcgagggtcg 4380 
ggctgaaggt gcgccagttc cacgggctgc 4440 
actccaaggc actcttctcc cagatcttgc 4500 
ccgtggccct gtccgtcccg gagattggtg 4560 
agtaccacaa ctacacccag ccccgtggca 4620 
gcgagtaccg getgeggcta tcgcccgacg 4680 
ggctgccgtc gggggtgggt gccacctgcg 4740 
ggcccacgtt gaacctgagc ageggggagt 4800 
gcatgtgtct ggagtccttc acacaggggc 4860 
cc tcgcccgc cccatctgac tcgccagcgt 4920 
tctccctgcc gcccaccgct gggccagaaa 4 980 
tggtacggga gcccgtccgc tgcacctgct 5040 
gcagtgtggg cgggcacccg ccccagatgc 5100 
tcaccggcca caatgtctct gagtacctgc 5160 
ggtatggggc catcaccttt ggaaaegtec 5220 
gggccccacc catggtgcgg aagatcgegg 5280 
acaagggcta tcacagcatg cccacctacc 5340 
ccaacctgcc caagagcaag ggcaacccgg 5400 
ccatgaataa gaccagcgcc agcctctcgc 5460 
tcatcgccat cttcatcatc gtggccatgt 5520 
tcgtggccga gaagtccacc aaggecaage 5580 
tcatctactg getggegaac tacgtgtggg 5640 
gctgtgtcat catcctgttt gtgttcgacc 5700 
ctgccgtcct ctccctcttc ctgctctatg 5760 
cctccttctg gttcgaggtc cccagctccg 5820 
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cctacgtgtt cctcattgtc atcaatctct tcatcggcat caccgccacc gtggccacct 5880 
tcctgctaca gctcttcgag cacgacaagg acctgaaggt tgtcaacagt tacctgaaaa 5940 
gctgcttcct cattttcccc aactacaacc. tgggccacgg gctcatggag atggcctaca 6000 
acgagtacat caacgagtac tacgccaaga ttggccagtt tgacaagatg aagtccccgt 6060 
tcgagtggga cattgtcacc cgcggactgg tggccatggc ggttgagggc gtcgtgggct 6120 
tcctcctgac catcatgtgc cagtacaact tcctgcggcg gccacagcgc atgcctgtgt 6180 
ctaccaagcc tgtggaggat gatgtggacg tggccagtga gcggcagcga gtgctccggg 6240 
gagacgccga caatgacatg gtcaagattg agaacctgac caaggtctac aagtcccgga 6300 
agattggccg tatcctggcc gttgaccgcc tgtgcctggg tgtgcgtcct ggcgagtgct 6360 
tcgggctcct gggcgtcaac ggtgcgggca agaccagcac cttcaagatg ctgaccggcg 6420 
acgagagcac gacggggggc gaggccttcg tcaatggaca cagcgtgctg aaggagctgc 6480 
tccaggtgca gcagagcctc ggctactgcc cgcagtgtga cgcgctgttc gacgagctca 6540 
cggcccggga gcacctgcag ctgtacacgc ggctgcgtgg gatctcctgg aaggacgagg 6600 
cccgggtggt gaagtgggct ctggagaagc tggagctgac caagtacgca gacaagccgg 6660 
ctggcaccta cagcggcggc aacaagcgga agctctccac ggccatcgcc ctcattgggt 6720 
acccagcctt catcttcctg gacgagccca ccacaggcat ggaccccaag gcccggcgct 6780 
tcctctggaa cctcatcctc gacctcatca agacagggcg ttcagtggtg ctgacatcac 6840 
acagcatgga ggagtgcgag gcgctgtgca cgcggctggc catcatggtg aacggtcgcc 6900 
tgcggtgcct gggcagcatc cagcacctga agaaccggtt tggagatggc tacatgatca 6960 
cggtgcggac caagagcagc cagagtgtga aggacgtggt gcggttcttc aaccgcaact 7020 
tcccggaagc catgctcaag gagcggcacc acacaaaggt gcagtaccag ctcaagtcgg 7080 
agcacatctc gctggcccag gtgttcagca agatggagca ggtgtctggc gtgctgggca 7140 
tcgaggacta ctcggtcagc cagaccacac tggacaatgt gttcgtgaac tttgccaaga 7200 
agcagagtga caacctggag cagcaggaga cggagccgcc atccgcactg cagtcccctc 7260 
tcggctgctt gctcagcctg ctccggcccc ggtctgcccc cacggagctc cgggcacttg 7320 
tggcagacga gcccgaggac ctggacacgg aggacgaggg cctcatcagc ttcgaggagg 7380 
agcgggccca gctgtccttc aacacggaca cgctctgctg accacccaga gctgggccag 7440 
ggaggacacg ctccactgac cacccagagc tgggccaggg actcaacaat ggggacagaa 7500 
gtcccccagt gcctgccagg gcctggagtg gaggttcagg accaaggggc ttctggtcct 7560 
ccagcccctg tactcggcca tgtcctgcgg tcactgcggt tgccggccct 7610 

<210> 29 

<211> 2219 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 156061&CB1 

<400> 29 

ggcagcatga gccgatcacc cctcaatccc agccaactcc gatcagtggg ctcccaggat 60 

gccctggccc ccttgcctcc acctgctccc cagaatccct ccacccactc ttgggaccct 120 

ttgtgtggat ctctgccttg gggcctcagc tgtcttctgg ctctgcagca tgtcttggtc 180 

atggcttctc tgctctgtgt ctcccacctg ctcctgcttt gcagtctctc cccaggagga 240 

ctctcttact ccccttctca gctcctggcc tccagcttct tttcatgtgg tatgtctacc 300 

atcctgcaaa cttggatggg cagcaggctg cctcttgtcc aggctccatc cttagagttc 360 

cttatccctg ctctggtgct gaccagccag aagctacccc gggccatcca gacacctgga 420 

aactcctccc tcatgctgca cctttgtagg ggacctagct gccatggcct ggggcactgg 480 

aacacttctc tccaggaggt gtccggggca gtggtagtat ctgggctgct gcagggcatg 540 

atggggctgc tggggagtcc cggccacgtg ttcccccact gtgggcccct ggtgctggct 600 

cccagcctgg ttgtggcagg gctctctgcc cacagggagg tagcccagtt ctgcttcaca 660 

cactgggggt tggccttgct ggttatcctg ctcatggtgg tctgttctca gcacctgggc 720 

tcctgccagt ttcatgtgtg cccctggagg cgagcttcaa cgtcatcaac tcacactcct 780 

ctccctgtct tccggctcct ttcggtgctg atcccagtgg cctgtgtgtg gattgtttct 840 

gcctttgtgg gattcagtgt tatcccccag gaactgtctg cccccaccaa ggcaccatgg 900 
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atttggctgc ctcacccagg tgagtggaat 
gcaggcatct ccatggcctt ggcagcctcc 
ggccggctgc tgcatttgcc tcccccacct 
gaggggctgg gcagtgtgct ggccgggctg 
ttccccaacg tgggcaaagt gggtcttatc 
gtggggctac tctgcgtggg gcttggactc 
atcccactgc ctgttgttgg tggggtgctg 
ggattctcca gcttctacct ggctgacata 
ttctccatct tcatggcctt gctgctgcca 
agcacaggct ggagcccctt ggatgtatta 
ctggctggac tctcaggctt cctactagag 
ggcctaggtc aagggctacc atctcctttc 
cccagggaga aggctgctca agtgtacaga 
tgcatccccc agcctctcca ctgcctctgc 
ggaggctcct ctgagccaga agagatggca 
cctgaatcta gcagagaagg gtttaggtcc 
tggttaattt agccctaact ctcatctgct 
gtaggcagag gatatgtgtg tgtgtattac 
tagggtgggt tgcctttcct tgtcttaatt 
atttagtgga ccatgaatga ttgagatttt 
cagcattcac tttaatactt actgagcatc 
ataagacagg gtccatgccg tctcaaggca 



tggcctttgc tgacgcccag agctctggct 960 
accagttccc tgggctgcta tgccctgtgt 1020 
ccacatgcct gcagtcgagg gctgagcctg 1080 
ctgggaagcc ccatgggcac tgcatccagc 1140 
caggctggat ctcagcaagt ggctcactta 1200 
tcccccaggt tggctcagct cctcaccacc 1260 
ggggtgaccc aggctgtggt tttgtctgct 1320 
gactctgggc gaaatatctt cattgtgggc 1380 
agatggtttc gggaagcccc agtcctgttc 1440 
ctgcactcac tgctgacaca gcccatcttc 1500 
aacacgattc ctggcacaca gcttgagcga 1560 
actgcccaag aggctcgaat gcctcagaag 1620 
cttcctttcc ccatccaaaa cctctgtccc 1680 
ccactgcctg aagaccctgg ggatgaggaa 1740 
gacttgctgc ctggctcagg ggagccatgc 1800 
cagaaatgac cagaacgcct acttctgccc 1860 
ggagagtcag ctcccaaact gttctttctt 1920 
atgggactgt ctagaggttc catttcccaa 1980 
aggcctaact gttccagagc agaggccatg 2040 
gcctgtgtac tatcaatgcc acttgaacca 2100 
tcccatgtgc aaggtcctgg aactacaggg 2160 
tttacggttt aaaaagacct ttgtaatta 2219 



<210> 30 

<211> 1280 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2614283CB1 

<400> 30 

ccgcaggagc cgggccggag tgagcgcacc 
cgccacccgg agtcccgcgc gcaactttca 
gccgggactc ggcgcgggac tgcatggagg 
ggccggcaat ccggtcatac acgggatctc 
tgccccgacc tggcctggac ttgcaggcca 
gccatgctca gaagggtcct gacagtcact 
agctgtatgt agcctctgcc atctgcctgt 
acctggcaca cagcttggct gtcatgactg 
gcatgctcat cagcctcttc tccctctgga 
actttggctg gcagagagct gagatcttgg 
tcgtgacggg ggtactggtg tacctggctg 
ttgacggggg gaccatgctg atcacgtcgg 
ggttgaccct tcaccagtct ggccatgggc 
agaaccccag cgtccgagct gccttcatcc 
gtgtcctagt ggcagcctat attttatact 
tctgcacctt cgtcttctcc atcctggtcc 
tgatcctggt gttgatggaa gggaccccca 
tgctgctgtc ggtggagggg gtagaagccc 
tggcccagcc tgttctgtct gtccacatcg 
tgctgaagac agccagcagc cgcctccaag 
agatcgagga ctactcggag gacatgaagg 
gactgctcag ccaggcacca 



tcgcggggcc cctcggggca ggtgggtgag 60 
gggcgcactc ggcggggcgg ctgcgcggct 120 
ccaaggagaa gcagcatctg ttggacgcca 180 
tgtggcagga aggggctggc tggattcctc 240 
ttgagctggc tgcccagagc aaccatcact 300 
gtgaccccaa gaaggggaag gcccagcgcc 360 
tgttcatgat cggagaagtc gttggtgggt 420 
acgcagcaca cctgctcact gactttgcca 480 
tgtcctcccg gccagccacc aagaccatga 540 
gagccctggt ctctgtactg tccatctggg 600 
tggagcggct gatctctggg gactatgaaa 660 
gctgcgctgt ggctgtgaac atcataatgg 720 
acagccacgg caccaccaac cagcaggagg 780 
atgtgatcgg cgactttatg cagagcatgg 840 
tcaagccaga atacaagtat gtagacccca 900 
tggggacaac cttgaccatc ctgagagatg 960 
agggcgttga cttcacagct gttcgtgatc 1020 
tgcacagcct gcatatctgg gcactgacgg 1080 
ccattgctca gaatacagac gcccaggctg 1140 
ggaagttcca cttccacacc gtgaccatcc 1200 
actgtcaggc atgccagggc ccctcagact 1260 

1280 
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<210> 31 

<211> 2727 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2667691CB1 

<400> 31 

cagtagtggt gggacggcac tagctgctgg 
cgccaggaat cgaggatggt aaaatgaccc 
gcagtatcat gctggccaag aagatcatca 
gttctcctag tgtctatcat gcagttatcg 
tattgacagc acccaccttg gtggtattac 
tgaacggctt aattcaagga gtaaagggtt 
gtgctctttc tgatgtttgg ggccgaaaat 
gtgccccaat tcctttaatg aagatcagcc 
ctggggtttt tgcagtgact ttttctgtgg 
agcatgaaag aagtatggct tatggactgg 
ccagtcctgc aattggagct tatcttggac 
tagctacagc aatagctttg ctagatattt 
tgcctgagaa aatgcggcca gcatcctggg 
cttttgcgtc cttaaaaaaa gtcggccaag 
tgtttctctc ctacctaccg gaggcaggcc 
agataatgaa attttcacca gaaagtgttg 
ccattattgc acagaccata gtcttgagtt 
ccattttact gggtctagga tttcaaatat 
aaccttggat gatgtgggct gctggggcag 
ctgtcagtgc acttgtttca cgaactgctg 
tgataacagg aattcgagga ttatgcaatg 
tctacatatt ccatgtggaa cttaaagaac 
acacaagccc tcagcaccac tttgaacaga 
ttggagcctg ttcagtactg ctggctctgc 
atttaagctt aaggtccagc agttggagaa 
atacacaagc gccaggagag gccaaagaac 
tgaaatcagg aagatttttc tatcagcacc 
tgtacattcc atttccatcc acagtgtact 
gaactccgtg ggaactaaag gaagtgggaa 
acaatttctt ttgaaaaacc ttttgtttat 
gttttattat acatccttta attaaaaact 
gtctctgcta ccatttcctt aaggtgttga 
cagtaggtag tatggttgtg gacctatttg 
ttaatattgt aaaatcttgg gtcaaataat 
aagaaatggt aaatgaattg tttgcattta 
aatctgaatc atgttttgag cttgtttgca 
tgaagtgaga aagtatcagc catttagcat 
aaatgctggc tcaaatttat tccccagcta 
cataatcata agcacctcaa cacttgaata 
tagccttgaa aatgtcagtg tgatatctat 
aggactgtca cagtaacact ttatttacag 
ccagaggaga gctttataca attaccgatg 
ccagttatat tatttaaaaa agtgttactt 
tttacactgt tttcatcttg tgtgtggtta 
cactatggtt aaataaggtc ttaaaagaaa 
tttaatgatt ttttttttaa aaaaaaa 



ggcctgccgc cccgggagtg gctgcagcag 60 
aggggaagaa gaagaaacgg gccgcgaacc 120 
ttaaggacgg aggcacgcct caaggaatag 180 
tcatcttttt ggagtttttt gcttggggac 240 
atgaaacctt tcctaaacat acatttctga 300 
tgttgtcatt ccttagtgcc ccgcttattg 360 
ccttcttgct gctaacggtg tttttcacat 420 
catggtggta ctttgctgtt atctctgttt 480 
tatttgcata cgtagcagat ataacccaag 540 
tttcagcaac atttgctgca agtttagtca 600 
gagtatatgg ggacagcttg gtggtggtct 660 
gttttatcct tgttgctgtg ccagagtcgt 720 
gagcacccat ttcctgggaa caagctgacc 780 
attccatagt gctgctgatc tgcattacag 840 
aatattccag ctttttttta tacctcagac 900 
cagcgtttat agcagtcctt ggcattcttt 960 
tacttatgag gtcaattgga aataagaaca 1020 
tacagttggc atggtatggc tttggttcag 1080 
tagcagccat gtctagcatc acctttcctg 1140 
atgctgatca acagggtgtc gttcaaggaa 1200 
gtctgggacc ggccctctat ggattcattt 1260 
tgccaataac aggaacagac ttgggaacaa 1320 
attccatcat ccctggccct cccttcctat 1380 
ttgttgcctt gtttattccg gaacatacca 1440 
agcactgtgg cagtcacagc catcctcata 1500 
ctttactcca ggacacaaat gtgtgacgac 1560 
caggtcttag ttttcacctc tagttctgga 1620 
ttaagattgt cttaagaaat gtatctgcat 1680 
cttagaacca gacagttttc caaagatgtt 1740 
tagcaccaat ttcttgccac taagctattt 1800 
atatatgtaa cttcttagat attagcaaat 1860 
gctttaactc tatgctgact cagtgagaca 1920 
ttttaacatt gtaaaatttt gagtcagatt 1980 
tcaaagcctt aatgcagatg cactaaaaca 2040 
aaaaaaaaaa ctcttaagaa aactgtacta 2100 
gtacttttaa acattattca ctactgtttt 2160 
ttaagttggg gtatttagag cctgtaatct 2220 
cttcttatac cactattctt ttaatgtttg 2280. 
cataatctaa aaattatata gtaaagctgg 2340 
tatgtagata aatatatata gtggcctttc 2400 
agctaatgtt tgtcctaaat tttcaggacc 2460 
tgaatttctc taaagtgtat atttttgtgt 2520 
tgtaaaaatt gtatataaag aactgtatag 2580 
ttgcttaatg ctttttaaac ttggaacact 2640 
tgtaaatatt ctgttaataa agttaaatat 2700 

2727 



52/71 



WO 02/22684 



PCT7US01/28938 



<210> 32 
<211> 1631 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3211415CB1 



<400> 32 

ttgcgcttga atgttcgttg actggcccgt cggttttaca aggcccggac aagcgctggg 60 

gattcccgtt tgaggcgtca ctactgtcac tgccatcacc ccacggagcc acttctagag 120 

gggagtagac ccggcccttc gccgggcaga gaagatgttg cccctgtcca tcaaagacga 180 

tgaatacaaa ccacccaagt tcaatttgtt cggcaagatc tcgggctggt ttaggtctat 240 

actgtccgac aagacttccc ggaacctgtt tttcttcctg tgcctgaacc tctctttcgc 300 

ttttgtggaa ctactctacg gcatctggag caactgctta ggcttgattt ccgactcttt 360 

tcacatgttt ttcgatagca ctgccatttt ggctggactg gcagcttctg ttatttcaaa 420 

atggagagat aatgatgctt tctcctatgg gtatgttaga gcggaagttc tggctggctt 480 

tgtcaatggc ctatttttga tcttcactgc tttttttatt ttctcagaag gagttgagag 540 

agcattagcc cctccagatg tccaccatga gagactgctt cttgtttcca ttcttgggtt 600 

tgtggtaaac ctaataggaa tatttgtttt caaacatgga ggtcatggac attctcatgg 660 

ctctggtggc cacggacaca gtcattccct ctttaatggt gctctagatc aggcacatgg 720 

ccatgtcgat cattgccata gccatgaagt gaaacatggt gctgcacata gccatgatca 780 

tgctcatgga catggacact ttcattctca tgatggcccg tccttaaaag aaacaacagg 840 

acccagcaga cagattttac aaggtgtatt tttacatatc ctagcagata cacttggaag 900 

tattggtgta attgcttctg ccatcatgat gcaaaatttt ggtctgatga tagcagatcc 960 1 

tatctgttca attcttatag ccattcttat agttgtaagt gttattcctc ttttaagaga 1020 

atctgttgga atattaatgc agagaactcc tcccctatba gaaaatagtc tgcctcagtg 1080 

ctatcagagg gtacagcagt tgcaaggagt ttacagttta caggaacagc* acttctggac 1140 

tttatgttct gacgtttatg ttgggacctt gaaattaata gtagcacctg atgctgatgc 1200 

taggtggatt ttaagccaaa cacataatat ttttactcag gctggagtga gacagctcta 1260 

cgtacagatt gactttgcag ccatgtagtg aatggaaaga aattatgcac cttttatgga 1320 

ccaaattttt ctggcccaac ccgatgagat gaagcatttc aaacttgagg agaagagaga 1380 

ctgcaggacg aggtggacag aaaaaccgtc agtaacaccg aggacatcta aaatctagtg 1440 

atgaacacga tgacaaccaa agtagcacaa gagaagaggc aagtcacagg gaggggtttg 1500 

ggaaccttat gtcacgctta agaactggga tgggaggttt taaacaaaaa aaaaaaaaaa 1560 

aaaaggggcg gccgccgact attgaaccct tcgcccgggg aataaattcc ggcccggtac 1620 

ctcgaggggg g 1631 

<210> 33 
<211> 2673 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4739923CB1 

' <400> 33 

ggacatttta aaagggccgg agattgcggg 
cagcatcggc ggcggcggct agtgccgcta 
tgggcaaacc ccgcttccag cagacgtcct 
tcatcgaccc tcgcacactc tttgtcactg 
tggaggacta taagcatggg accctgcgcc 
. cacagaaaat caagcaggct attctacatc 
ttagaatgcc aggttatatt ccttttggga 



cgtcagtggc catggcggat acagcgacta 60 

gcgcctcgag cgatgcacct cctttccaac 120 

tctatggccg cttcaggcac ttcttggata 180 

agagacgtct cagagaggct gtgcagctgc 240 

cgggggtcac caatgaacag ctctggagtg 300 

cggacaccaa tgagaagatc ttcatgccat 360 

cgccaattgt agtcggtctt ctcttgccca 420 
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accagacact ggcatccact gtcttctggc agtggctgaa ccagagccac aatgcctgtg 480 
tcaactatgc aaaccgcaat gcgaccaagc cttcacctgc atccaagttc atccagggat 540 
acctgggagc tgtcatcagc gccgtctcca ttgctgtggg ccttaatgtc ctggttcaga 600 
aagccaacaa gctcacccca gccacccgcc ttctcatcca gaggtttgtg ccgttccctg 660 
ctgtagccag tgccaatatc tgcaatgtgg tcctgatgcg gtacggggag ctggaggaag 720 
ggattgatgt cctggacagc gatggcaacc tcgtgggctc ctccaagatc gcagcccgac 780 
acgccctgct ggagacggcg ctgacgcgag tggtcctgcc catgcccatc ctggtgctac 840 
ccccgatcgt catgtccatg ctggagaaga cggctctcct gcaggcacgc ccccggctgc 900 
tcctccctgt gcaaagcctc gtgtgcctgg cagccttcgg cctggccctg ccgctggcca 960 
tcagcctctt cccgcaaatg tcagagattg aaacatccca attagagccg gagatagccc 1020 
aggccacgag cagccggaca gtggtgtaca acaaggggtt gtgagtgtgg tcagcggcct 1080 
ggggacggag cactgtgcag ccggggagct gaggggcagg gccgtagact cacggctgca 1140 
cctgcaggga gcagcacgcc aaccccagca gtcctgggcc ccctgggaga gtgctcaacc 1200 
tacagtggag ggagactgac ccattcacat tttaacatag gcaagaggag ttctaacaca 1260 
tttcgtacaa aaaaataatc aagtgcattt ctgggcctta tgtggggttg tcaaaactcc 1320 
actcagcaca attatgtgtg aagctgaaaa attgtagagt gcccatgggg tagaagtaga 1380 
atccttttat actttggttc cttttttatt tttatttttt atcagaatca aatctgagcc 1440 
ttagtttcag ctgaccagaa gtggcaggag gacaggtgga ggcgagccag attaggcctg 1500 
gagttgggct ggtttggtgg ccaggctggt aaatttagga tattacaatg gccagcccag 1560 
atggctccct gggctggcat ggggagggga gagaaggtgg tctgcacccc acaggatgaa 1620 
ctagccatga ctagggtcct caggcagtgg cccagggaat cagggagcac tggaggcctc 1680 
tgcaagattc tgtgggcagc tggctctgaa tgtagccagc ccacatccat tccagagctg 1740 
cagaaccagt ccctctgagt gaagtccagt gaccctggag ctagggtccc ccttcgtgga 1800 
gctctaactg attcagggcc cctgaagtga ccccagctcc aggcaggaaa ccccgagaag 1860, 
gaatggtgct tggcaggaac catggacctg cacttggcct cttcgggaag atcctccttc 1920 
caggccccag gctggatgct gggttctggg gctgagagtg gggctagact gggctggctg 1980* 
ccttctgcgg agcttctcca gccaccaagg ctgcctgccc catcccatcc ctttctaagc 2040 
aggaaggtct catgcctgag aatgcctagg ccagctcctt agacacctat cagagaagca 2100 
gcatcaacct aggagcagtg ggccctggct ctgtcactaa atggcagtga gatgtcagac 2160' 
aaattcattc ccttctctca gcctccactc cctgtctgaa accagaagac tggatcaagg 2220 
ggttcccact ggctcctcca gcaagacctc gtctttgctt gtcctgctca gatgctggtc 2280 
atcctgggca tgtccccagt gtggactctg gactgggaag ggggcaggcc cctttggacc 2340 
tgcagttggc ctcagcagaa ggccttgcct tgtgtatgtg actccatatc ccgggagcag 2400 
ttgacctttg ccaaacactt tacagttctg gaggaggagg taacatagat gcctgggcct 2460 
gatggtgggg ccatacccat gtgtcgcctc tcactctggc agcctcagag gccccttgct 2520 
gctggctccc atctccctcc catttgcaga ccaggaagga agagcaagct gtacaaaggg 2580 
aagcagagcc tggggtgggt gtgagcaggg tgacccctca tctgaaaggc ccaaaccagg 2640 
gggaagcacc agcctcagtg cagccccctc ctg 2673 

<210> 34 
<211> 3958 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> m±sc_feature 

<223> Incyte ID No: 55030459CB1 

<400> 34 

atggcccgcc agccggagga agaggagacg gccgtggccc gggcgcggcg gccgcccctc 60 
tggctgctct gcctggtcgc gtgctggctc ctgggcgccg gggccgaagc cgacttctcc 120 
atcctggacg aggcgcaagt gctggcgagc cagatgcgga ggctggcggc cgaggagctg 180 
ggggtcgtca ccatgcagcg gatattcaac tcctttgttt acactgagaa aatctcaaat 240 
ggagaaagtg aagtacagca gctagccaaa aaaatccgag agaagttcaa ccgttacttg 300 
gatgtggtca atcggaacaa gcaagttgta gaagcatcct atacggctca cctaacctct 360 
cccctaactg caattcaaga ctgctgtact atcccacctt ccatgatgga attcgatggg 420 
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aactttaata ccaatgtgtc tagaacaatt 
agccgggcct tcaatccagg acgagactta 
aaccctggaa ttaagtggca atatttcagt 
gcacacaagt tccggtgtaa gggcagctac 
acagtccggc cgcagtcaaa gcacatagta 
gacactcagc ttcagattgc caaggacgct 
catgacaaga tttctgtgtt aactgtggca 
tgctataaga ccttcttgtc tccagccacc 
gttagcagcg tgaagtcttc agacagtcct 
tttcagctga ttcgaagtac aaacaataac 
atcatttacc tgtcagctgg cattacatca 
actctccaag tcatcaatga agaaaatagc 
tatgccctca tgaacgatgg ggtgactggt 
gctgaacaga attcagggaa gtacggtgtg 
ggcagcatga tggtgctgaa tcagttgagc 
acaaaccttc ccaaccggat gattgatgaa 
atgggagatg gtttgataat gactgtgagt 
ggaattgtag gtgtggacgt gaatctggct 
gactctttgg cttcctatac ttttctcata 
tctcttacca ggccatattt attgtcagag 
gaaaatattc caaaatttga attagttcgg 
cagattatcg cagtccctgt gaactcatcc 
actggaaagg aagcctacaa tgttagctat 
attctgtgta ttgtggtgat acaaccagaa 
actgttccca gcagcaagct gctgtaccac 
tgcctccact tcaaacagct ggcaacccta 
agcttttcct ccccctatga gcacctcagc 
tacaccgcct atctcagcga caacacccgc 
gtcagaaatg aagtaatggc taccagccac 
atgagtagcc tgaacactta cattgtccgc 
agaatttatc ctggttccct catggacaaa 
ctccatgcag tagctaatcc agggttgatt 
ggagctggtt atgttgtgac aatcagtcac 
tctgggcaca ctgtggctgt gatgggcatt 
ctgatggacc tattacctgt ctgtaaccaa 
atggaggaca ggggttatct ggtggcgcac 
cctgtggagc agcagcacat cacccacaag 
caccccaact ttgtaaagaa aaacctgtgc 
ttttataaat tcaacaccag ccttgcgggg 
tgttccaaat acagactagc aaggatccca 
gaaacctgcg actctcttgc cttctgtgcc 
tgtcaccgaa tggaacaaaa tgaatgtgaa 
gagtgcactg gcaacctcac caatgcagag 
gagccggtga catacacagc tattgaccct 
aacagcaggt gcagtcagag gctggaaagt 
tggtgcatgg tggacagtga tggaaagact 
aaagaatgct tcggggggat tgtgggagcc 
ataggtgatg aggtgatcac attaaacatg 
ggagggatca tgggatgcat catggtcttg 
attcatcgcc ggagccatca gcatatgtct 
atgtccaacc tggagaatga cagagatgaa 
atcatcagca acactcggtt tatagctgcg 
agaaggcgcc gctactgggg tcgatcagga 
agcccacagg aggacagtga aaatcctcca 
gatgtgggaa accatgatga ggacttagac 
ctaagtcaca agttccacca ctaccggtca 



agttgtgatc gactttctac tactgttaat 480 
aattcagttc ttgcagacaa cctgaaatcc 540 
tcagaagaag gaattttcac tgttttccca 600 
gaacaccgca gtagacccat ctacgtctct 660 
gtgattctgg accacggggc ttcagtcaca 720 
gctcaggtca tcctcagcgc catcgatgaa 780 
gataccgtcc ggacttgctc actagaccag 840 
agtgagacaa aaaggaaaat gtccaccttt 900 
acccagcacg cagtgggatt ccaaaaggca 960 
acaaagttcc aagcaaatac agacatggtc 1020 
aaggactctt cggaagaaga taaaaaagcg 1080 
tttctaaaca actctgtaat gattctcacc 1140 
ttgaaagagc tggcttttct gagggatcta 1200 
ccagaccgga cggccttgcc tgtgattaag 1260 
aacctggaga ccacagtggg caggttctac 1320 
gccgtcttca gcctgccctt ctctgatgag 1380 
aaaccctgtt attttggaaa cctacttctg 1440 
tacattcttg aagacgtgac gtattaccaa 1500 
gacgacaaag gatatacact tatgcaccca 1560 
cccccacttc atactgacat catacattat 1620 
caaaatatcc taagcctccc tctgggcagc 1680 
ctgtcttggc acataaacaa gctgagagaa 1740 
gcctggaaga tggtacaaga cacttccttt 1800 
atacctgtga aacaactgaa gaacctcaac 1860 
cggctggatc tccttggcca gcccagtgct 1920', 
gaaagtccca ccatcatgct gtctgctggc 1980 
cagccagaga caaagcgcat ggtagagcac 2040 
ctcattgcta acccgggcct caaattctct 2100 
gtcacagatg aatggatgac acaaatggaa 2160 
cgttacatag caacacccaa tggcgtcctc 2220 
gcatttgatc ccactaggag acaatggtat 2280 
tctttgactg gtccttactt agatgttgga 2340 
acaattcatt catccagtac acagctgtct 2400 
gacttcacac tcagatactt ctacaaagtt 2460 
gatggtggca acaaaataag gtgcttcata 2520 
ccgactctca tcgaccccaa aggacatgca 2580 
gagcccctgg tagcaaatga tatcctcaac 2640 
aacagcttca gtgacagaac ggtccagagg 2700 
gatttgacga accttgtgca tggcagccac 2760 
ggaaccaacg cgtttgttgg cattgtcaac 2820 
tgcagcatgg tggaccgact ctgtctcaac 2880 
tgtccttgtg agtgccctct agaggtcaat 2940 
aaccgaaacc ccagctgcga ggtccaccag 3000 
ggcctgcaag atgctcttca ccagtgtgtc 3060 
ggggactgtt ttggggtgct ggattgtgaa 3120 
cacctggaca aaccctactg tgccccccag 3180 
aaaagtccct acgttgatga catgggagca 3240 
attaaaagcg cccctgtggg tcctgtggct 3300 
gtcctggcgg tgtatgccta ccgccaccag 3360 
cctcttgctg cccaagaaat gtcagtgcgt 3420 
agggacgacg acagccacga agacagaggc 3480 
gtcatcgaac gacatgcaca cagtccagaa 3540 
acagaaagtg atcatggtta cagcaccatg 3600 
tgcaacaatg accccttgtc agccggggtc 3660 
ctggataccc cccctcagac tgctgcccta 3720 
caccacccta cacttcatca tagccaccac 3780 



55/71 



WO 02/22684 



PCT/US01/28938 



ttacaggcgg ccgtcacggt acacactgtc gatgcagaat gctaacaatc tcctcacctc 3840 
cacgccaaga tgagatctgg gagctacaga atgttctgga aagaaaaaga accggcttaa 3900 
aacccacaag cagagacctc ccttgtgttt gtgctttgtg cagagttgtt tgagtcat 3958 

<210> 35 

<211> 2000 

<212> DNA 

<213> Homo ' sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6113039CB1 

<400> 35 

gctcaggaca atgaaattct tcagttacat tctggtttat cgccgatttc tcttcgtggt 60 
tttcactgtg ttggttttac tacctctgcc catcgtcctc cacaccaagg aagcagaatg 120 
tgcctacaca ctctttgtgg tcgccacatt ttggctcaca gaagcattgc ctctgtcggt 180 
aacagctttg ctacctagtt taatgttacc catgtttggg atcatgcctt ctaagaaggt 240 
ggcatctgct tatttcaagg attttcactt actgctaatt ggagttatct gtttagcaac 300 
atccatagaa aaatggaatt tgcacaagag aattgctctg aaaatggtga tgatggttgg 360 
tgtaaatcct gcatggctga cgctggggtt catgagcagc actgcctttt tgtctatgtg 420 
gctcagcaac acctcgacgg ctgccatggt gatgcccatt gcggaggctg tagtgcagca 480 
gatcatcaat gcagaagcag aggtcgaggc cactcagatg acttacttca acggatcaac 540 
caaccacgga ctagaaattg atgaaagtgt taatggacat gaaataaatg agaggaaaga 600 
gaaaacaaaa ccagttccag gatacaataa tgatacaggg aaaatttcaa gcaaggtgga 660 
gttggaaaag aactcaggca tgagaaccaa atatcgaaca aagaagggcc acgtgacacg 720 
taaacttacg tgtttgtgca ttgcctactc ttctaccatt ggtggactga caacaatcac 780 
tggtacctcc accaacttga tctttgcaga gtatttcaat acacgctatc ctgactgtcg 840 
ttgcctcaac tttggatcat ggtttacgtt ttccttccca gctgccctta teat tc tact 900 
cttatcctgg atctggcttc agtggctttt cctaggattc aattttaagg agatgttcaa 960 
atgtggcaaa accaaaacag tccaacaaaa agcttgtgct gaggtgatta agcaagaata 1020 
ccaaaagctt gggccaataa ggtatcaaga aattgtgacc ttggtcctct tcattataat 1080 
ggctctgeta tggtttagtc gagaccccgg atttgttcct ggttggtctg cacttttttc 1140 
agagtaccct ggttttgcta cagattcaac tgttgcttta cttatagggc tgetattett 1200 
tcttatccca gctaagacac tgactaaaac tacacctaca ggagaaattg ttgcttttga 1260 
ttactctcca ctgattactt ggaaagaatt ccagtcattc atgccctggg atatagecat 1320 
tcttgttggt ggagggtttg ccctggcaga tggttgtgag gagtctggat tatctaagtg 1380 
gataggaaat aaattatctc ctctgggttc attaccagca tggctaataa ttctgatatc 1440 
ttctttgatg gtgacatctt taactgaggt agecagcaat ccagctacca ttacactctt 1500 
tctcccaata ttatctccat tggecgaage cattcatgtg aaccctcttt atattctgat 1560 
accttctact ctgtgtactt catttgeatt cctcctacca gtagcaaatc cacccaatgc 1620 
tattgtcttt tcatatggtc atctgaaagt cattgacatg gttaaagctg gacttggtgt 1680 
caacattgtt ggtgttgctg tggttatgct tggcatatgt acttggattg tacccatgtt 1740 
tgacctctac acttaccctt cgtgggctcc tgctatgagt aatgagacca tgecataata 1800 
agcacaaaat ttctgactat ettgeggtaa tttctggaag acattaatga ttgactgtaa 1860 
aatgtggctc taaataacta atgacacaca tttaaatcag ttatggtgta getgetgeaa 1920 
ttcccgtgaa tacccgaaac ctgctgttat aactcagagt ccatatttgt tattgeagtg 1980 
caactaaaga gcatctatgt 2000 

<210> 36 

<211> 1997 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_£eature 
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<223> Incyte ID No: 7101781CB1 
<400> 36 

ctgctcgcca ctggccggcg cgctcccggc gcacggagca cactcgcgct cccggcgcac 60 
ggagcacact cgcgctccgg gactgaaacc tgagcagccg tagcagccga atttgggagc 120 
atatccttgt cactgcagcc agaaagccct tcgatcccca tcagagaggt cacatgagcc 180 
ccgaggtcac ctgcccgcgg aggggccacc tgcctcgctt ccacccgagg acctgggttg 240 
agcccgtggt ggcatcgtcc caggtggctg cctccctcta cgatgcgggg ctactcctcg 300 
tggtgaagge gtcctacgga accggaggct cctccaacca cagtgccagc ccatcgcccc 360 
ggggggctct agaggaccaa cagcagagag ccatctccaa tttctacatt atctacaacc 420 
ttgtggtggg cctgtccccc ctgctgtccg cctacgggct gggatggctc agcgaccgct 480 
accaccgaaa gatctccatc tgcatgtcgc tgctgggctt cctgctctcc cgcctcgggc 540 
tgctgctcaa ggtgctgctg gactggccag tggaggtgct gtacggggcg gcggcgctga 600 
acgggctatt cggcggcttc tccgccttct ggtccggggt catggcgctg ggatcgctgg 660 
gctcctccga gggccgccgc tctgtgcgcc tcatcctcat tgacctgatg ctgggcttgg 720 
cggggttctg cgggagcatg gcttccgggc atctcttcaa gcagatggct gggcactctg 780 
ggcagggcct gatactgacg gcctgcagcg tgagctgtgc ctcgtttgcc ctgctctaca 840 
gccttttggt gctaaaggtc cctgagtcgg tggccaaacc cagccaggag ctccccgccg 900 
tggataccgt gtctggcacg gttggcacat accgcactct ggatcctgat cagttggacc 960 
aacagtatgc agtggggcac cctccatctc ctggaaaagc aaaaccccat aaaaccacca 1020 
ttgccttgct ctttgtgggt gctatcatat atgacctggc ggtggtgggc acagtggacg 1080 
tgatccctct ttttgtgctg agggagcctc tcggttggaa ccaagtgcag gtgggctatg 1140 
gtatggctgc agggtacacc atcttcatca ccagcttcct gggtgtcctg gtcttctccc 1200 
gctgctttcg ggacaccacc atgatcatga ttgggatggt ctcctttggg tcaggagccc 12 6-0 
tcctcttggc ttttgtgaaa gagacataca tgttctatat tgctcgagcc gtcatgctgt 1320 
ttgctctcat ccccgtcaca accatccgat cagctatgtc caaactcata aagggctcct 1380 
cttatggaaa ggtgttcgtc atactgcagc tgtccttggc tctgaccggc gtggtgacat 1440 
ccaccttgta caacaagatc taccagctca ccatggacat gtttgtgggc tcctgctttg 1500 
ctctctcctc ctttctctcc ttcctggcca tcattccaat tagcatcgtg gcctataaac 1560 
aagtcccatt gtcaccatat ggagacatca tagagaaatg aagatgctta cctgcaggaa 1620 
ctgaaaacat cagccatggc caggccccca gaagacaaaa gaagggacca gggaactggt 1680 
gacctaagca acccactgct taagaaacct gcgttccagc cagagttggc ctcagaatga 1740 
cctgctctgg ctcagggatc cctggtggat ggggaaaagc actttcctgg tgatggaaaa 1800 
acgttctcag ctttaagaca cccccattag gcagacactg ggttctgtga cagcagagca 1860 
tgaccttaag ggttacaggg aggctgcaca cagcagtccc aggccctgtg aggggcttca 1920 
gactccagct acagcgagcc tgcccctttt cttcaaggga ctgtcttgag ggctccaaag 1980 
tatagctaac tagtcac 1997 

<210> 37 

<211> 3069 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7473036CB1 

<400> 37 

atgcaaccag ccagagggcc cctggcttca gaacctagga ctgtactggt tctgagattc 60 

tgtgcaagcc tcatggaaat gaagctgcca ggccaggaag ggtttgaagc ctccagtgct 120 

cctagaaata ttccttcagg ggagctggac agcaaccctg accctggcac cggccccagc 180 

cctgatggcc cctcagacac agagagcaag gaactgggag tacccaaaga ccctctgctc 240 

ttcattcagc tgaatgagct gctgggctgg ccccaggcgc tggagtggag agagacaggc 300 

aggtgggtac tgtttgagga gaagttggag gtggctgcag gccggtggag tgccccccac 360 

gtgcccaccc tggcactgcc cagcctccag aagctccgca gcctgctggc cgagggcctt 420 

gtactgctgg actgcccagc tcagagcctc ctggagctcg tgggctctac tcatccaaga 480 
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aaggcttctg acaatgagga agcccccctg 
aagctacctc caggagctga ggcagggact 
cagccactgg gagcctttgt tcgactgcgg 
gtgtccctcc caagcaggtt tttctgcctt 
taccatgaga tgggacgggc agcagctgtc 
gttcgtcggg ccagcaacct tcatgacctt 
gtgacagtgc ttcccccagg tcggtgggac 
ctgccatctc agcacaaaag gcttccctcg 
ccgcgcctga cctcggctga ggacaggcac 
ttgcagcgga ccggcagcga tttcttggac 
ctctacattt acctggccac tgtcactaat 
gccactgatg gtgcccaggg agtgctggaa 
gccttctgcc tgatggcagg ccagcccctc 
gtctttgagc gcctgctctt ctctttcagc 
cgcctatggg tgggcatctg ggtggctacc 
agtgtgctgg tgcgctactt cacccgcttc 
ctcatcttca tctacgatgc tgtgggcaaa 
cagaagcctg ggtcctctgc ctacgggtgc 
gagtctcaat ggataaggac aaggccaaaa 
ggcctgatca atgcatcctt gctgccgcca 
cgtggccctg gctgtcatac agtcccagac 
acttctttct tctttgctat ggccctcaag 
gtggtgcgca aagggctcag cgacttctcc 
cttgatgctt tcctgggcct agccacacca 
acactccctg ggcgtggctg gctggtgtca 
gtggcagctg ccctgcctgc cc tgctgctg 
acagcagtca tcctcaaccg catggaatac 
gacctcttct gtgtggctgt gctgatgcta 
gtctcagcca ctgtcatctc cctggctcac 
tgtgcccccg gggagcgccc caacttcctg 
gtggtgttca tccttacagg agcctccatc 
atgcctgtgc tctatggcat cttcctgtat 
ttcactaata gggtgaagct gttgttgatg 
ttgcggcatg tgcctctgac cagggtccac 
gggctgcttt ggataatcaa gtctacccct 
ggccttgtgg gggtcccfaaa ggccctggag 
ctggatgagc tgatgccaga ggaggagaga 
cac teat tea gtggaagtga cagtgaagat 
gaaatcaaca tttctgtgaa ttagctggag 
cagcatgagt tcacaggtgc ttactcagga 
ttccagatgc teagtegget tggggaagga 
ctcctgacct gagggtggag agtggcagga 
ggctggtgag ccagagggac tgatcaggee 
gagecacaa 

<210> 38 

<211> 2241 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7476943CB1 

<400> 38 

gccggcggcc cccgctcccg gatccccagc 



agggaacagt gtcagaaccc cctgagacag 540 
gtgctggcag gggagctggg cttcctggca 600 
aaccctgtgg tactggggtc ccttactgag 660 
ctcctgggcc ectgtatget gggaaagggc 720 
ctcctcagtg acccgcaatt ccagtggtca 780 
ctggcagccc tggatgeatt cctagaggag 840 
ccaacagccc ggattccccc geccaaatgt 900 
caacageggg agatcagagg tcccgccgtc 960 
cgccatgggc cacacgcaca cagcccggag 1020 
gccctgcatc tecagtgett ctcggccgta 1080 
gccatcactt ttgggggtct gctgggagat 1140 
agtttcctgg gcacagcagt ggctggagct 1200 
accattctga geagcaeggg gccagtgctg 1260 
agagattaca gectggacta cctgcccttc 1320 
ttttgcctgg tgctggtggc cacagaggcc 1380 
actgaggaag gtttctgtgc cctcatcagc 1440 
atgetgaact tgacccatac ctatcctatc 1500 
ctctgccaat acccaggccc aggaggaaat 1560 
gacagagacg acattgtaag catggactta 1620 
ectgagtgea cccggcaggg aggccaccct 1680 
attgecttet tctcccttct cctcttcctt 1740 
tgtgtaaaga ccagccgctt cttcccctct 1800 
tcagtcctgg ccatcctgct cggctgtggc 1860 
aagctcatgg tacccagaga gttcaagccc 1920 
ccttttggag ccaacccctg gtggtggagt 1980 
tctatcctca tcttcatgga ccaacagatc 2040 
agactgeaga agggagctgg cttccacctg 2100 
ctcacatcag cgcttggact gecttggtat 2160 
atggacagtc tteggagaga gagcagagee 2220 
ggtatcaggg aacagaggct gaeaggectg 2280 
ttcctggcac ctgtgctcaa gttcattcca 2340. 
atgggggtgg cagcgctcag cagcattcag 2400 
ccagcaaaac accagccaga cctgctactc 2460 
ctcttcacag ccatccagct tgcctgtctg 2520 
gcagccatca tcttccccct catgttgctg 2580 
agggtcttct caccacagga actcctctgg 2640 
agcatccctg agaaggggct ggagccagaa 2700 
tcagagctga tgtatcagee aaaggctcca 2760 
taggagtctg ggagtggaga ccccaggaaa 2820 
agtcaggaca tttttggcct ttggcttaac 2880 
ctgaagggca getgecaaga cctcagttac 2940 
agcaagcatg tttgctgtgc acttaggaaa 3000 
ccattcactc tctactcatt aaaaggtcct 3060 

3069 



gccctggcca agaagcttcc tcggctcccc 60 



58/71 



WO 02/22684 



PCT/US01/28938 



ctcttccctc tccctgacac ggttgtgcag 
ccattctact ttttgtgtct atgagtttga 
ccatgggcca ggtagcgttc tatgccaacc 
aaactcttcc aagatcataa cttggctgtt 
aaccatggca aaagtaaata gagctcggtc 
gatgattgtg gctggctgtt tccttgttac 
ctcaattttt tttgtggagt tccagacata 
gatccattcc attgtagatt gtgtgaccat 
taaccattta tcctgtcaag tgggaatcat 
catcctgagc tcatttgcca cgagtctgaa 
aggtcttgga tttgcacttt gttactctcc 
cagacggaaa gcccttgctt atggtatcgc 
cctggctcct gtggttcagc tccttattga 
tcttgggggc tttgtcttga atctctgtgt 
taaagaggac cacacaactc cagagcagaa 
taagcgggtg tctccctatt catctttgac 
ctgtttgcag caagagtaca gttttttact 
cgttctgttt atggcttatg gctgcagccc 
gagtgttgga gtgagtcatc agcaagctgc 
cattattggc aatatcacat ttggatggct 
gtatgtttgc tacctctttg ccgtgggaat 
gcttcaaagt ctccctctgc tfcgtgccttt 
ctatgtgact ttgatcccag tagtgaccac 
agcgcttggt gtggtatact tccttcacgc 
aggacggctg gtagatacca ccggcagcta 
aatgatattt agttctgtgt tgcttggctt 
ccagttgcag ttcattgcca aagaatctga 
agtggcttat tctgtggcaa gagaattaga 
agtgcctggc tacagcctca catgaccaaa 
gagaggtggg gccaccagat tcttcatgtt 
gccttccaag gaaattatta ttattgtttt 
agcacataat aaggaaagct ggactagccc 
aactgaactc gtatctttgg tcaatgggca 
gatataatta gctgtaatta gggataattt 
cagcagcttt gttccagtct caggcccttc 
taacgtggta gctttggtta cttggtctgg 
aagtgaaaag caaataaagt a 



agggcgcggt ggctcaggcc ctggcaacca 120 
ctaccctaag gacctcacat ggcgagtaac 180 
ttgaatgcca tcaggaagtc actggacagc 240 
ggagcaacct ggaaaagaag aaaaaagaaa 300 
tacctcccct ccagatggag gctggggctg 360 
catctgcaca cgggcagtca caagatgtat 420 
cttcactcag gattacgcac aaacggcatg 480 
gctctgtgct ccacttggga gtgttgtcag 540 
gctgggtggc ttgcttgcat ctactggact 600 
gcatctctac ctcactctgg gagttcttac 660 
agctattgcc atggttggca agtacttcag 720 
catgtcagga agtggcattg gcaccttcat 780 
acagttttcc tggcggggag ccttactcat 840 
atgtggtgcc ttgatgaggc caattactct 900 
ccatgtgtgt agaactcaga aagaagacat 960 
caaagaatgg gcacagactt gcctctgttg 1020 
catgtcagac tttgttgtgt tagccgtctc 1080 
tctctttgtg tacttggtgc cttatgcttt 1140 
ttttcttatg tccatacttg gagtgattga 1200 
gaccgacaga aggtgtctga agaattacca 1260 
ggatgggctc tgctatctct gcctcccaat 1320 
ctcttgtacc tttggctact ttgatggtgc 1380 
agagatagtg gggaccacct ctttgtcatc 1440 
agtgccatac ttggtgagcc cacccatcgc 1500 
cactgcagca ttcctcctct gtggattttc 1560 
tgctagactt ataaagagaa tgagaaaaac 1620 
tcctaagctg cagctatgga ccaatggatc 1680 
tcagaaacat ggggagcctg tggctacagc 1740 
ggccttgagc cccagaatct tcaggtttga 1800 
tctgaaactt tttattttgg cagaaggatt 1860 
gttaacatat taatatttat aagggaaaac 1920 
agagccttct catttgggat ttgtgctcat 1980 
tagctctgta agaaatgtaa ggacacagct 2040 
caaagcataa ccaaagcaga tgacactggg 2100 
atgttccctc ctcagaaaga aatggaacat 2160 
ttagagaagg aggccagtga gtggggggtg 2220 

2241 



<210> 39 

<211> 1593 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 8003355CB1 

<400> 39 

ccttggagct gttgtcccac ccctgtcact 
caggggccgc tgctcctcct gctgctgctg 
caggaggagc gcctgctcgc agacctgatg 
gaacgagact cggatgtggt caatgtcagc 
ctgaacgagc gagaggaagc cctcaccacc 
tatcgcctgc gccgggatcc gcgagactac 
accatggtgt ggcggccgga tatcgtgctg 
gccctctact gcaatgtgct cgtgtcccct 



gcagagagct gaggcaccat gcatgggggc 60 
gctgtctgcc tgggggccca gggccggaac 120 
caaaactacg accccaacct gcggcccgcg 180 
ctgaagctaa ccctcaccaa cctcatctcc 240 
aatgtctgga tagaggtgca gtggtgcgac ' 300 
gaaggcctgt gggtgctgag ggtgccgtcc 360 
gagaacaacg cggacggtgt cttcgaggtg 420 
gacggctgta tctactggct gccgcctgcc 480 



59/71 



WO 02/22684 



PCT/US01/28938 



atcttccgtt ccgcctgctc 
tcccttatct tccagtccca 
gaagatggcc agaccatcga 
gagtgggcca tccagcaccg 
gaagcaggcc accagaaggt 
gtcatcaaca tcatcgcccc 
cttcctgcca aggctggggg 
actgtcttcc tcttccttgt 
atcagcaagt acctgacctt 
gttgtgctca atgtctcctt 
ctgaggctct tgccccagct 
caggacaccc agtcccggct 
gaggtggccc tctgcctgcc 
ctggtggcgg cagcgctgga 
tgtggcagcc tgaagcaggc 
attgcctgtg cccggcacca 
gtgggccgag tgctggaccg 
acagctggca tcttcctcat 
gatccacgcc cctacctgcc 

<210> 40 
<211> 2121 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_£eature 
<223> Incyte ID No: 3116448CB1 

<400> 40 

gtacaaagga cctccagacc . agagccagcc agcagcaaaa agagcatgga gctgaggagt 60 
acagcagccc ccagagctga gggctacagc aacgtgggct tccagaatga agaaaacttt 120 
cttgagaacg agaacacatc aggaaacaac tcaataagaa gcagagctgt gcaaagcagg 180 
gagcacacaa acaccaaaca ggatgaagaa caggtcacag ttgagcagga ttctccaaga 240 
aacagagaac acatggagga tgatgatgag gagatgcaac aaaaagggtg tttggaaagg 300 
aggtatgaca cggtatgtgg tttctgtagg aaacacaaaa caactcttcg gcacatcatc 360 
tggggcattt tattagcagg ttatctggtt atggtgattt cggcctgtgt gctgaacttt 420 
cacagagccc ttcctctttt tgtgatcacc gtggctgcca tcttctttgt tgtctgggat 480 
cacctgatgg ccaaatacga acatcgaatt gatgagatgc tgtctcctgg cagaaggctt 540 
ctaaacagcc attggttctg gctgaagtgg gtgatctgga gctccctggt cctagcagtt 600 
attttctggt tggcctttga cactgccaaa ttgggtcaac agcagctggt gtccttcggt 660 
gggctcataa tgtacattgt cctgttattt ctattttcca agtacccaac cagagtttac 720 
tggagacctg tcttatgggg aatcgggcta cagtttcttc ttgggctctt gattctaagg 780 
actgaccctg gatttatagc ttttgattgg ttgggcagac aagttcagac ttttctggag 840 
tacacagatg ctggtgcttc atttggcttt ggtgagaaat acaaagacca cttctttgga 900 
tttaaggtcc tggcgatcgt ggttttcttc agcactgtga tgtccatgct gtactacctg 960 
ggactgatgc agtggattat tagaaaggtt ggatggatca tgctagttac tacgggatca 1020 
tctcctattg aatctgtagt tgcttctggc aatatatttg ttggacaaac ggagtctcca 1080 
ctgctggtcc gaccatattt accttacatc accaagtctg aactccacgc catcatgacc 1140 
gccgggttct ctaccattgc tggaagcgtg ctaggtgcat acatttcttt tggggttcca 1200 
tcctcccact tgttaacagc gtcagttatg tcagcacctg cgtcattggc tgctgctaaa 1260 
ctcfctttggc ctgagacaga aaaacctaaa ataaccctca agaatgccat gaaaatggaa 1320 
agtggtgatt cagggaatct tctagaagct gcaacacagg gagcatcctc ctccatctcc 1380 
ctggtggcca acatcgctgt gaatctgatt gccttcctgg ccctgctgtc ttttatgaat 1440 
tcagccctgt cctggtttgg aaacatgttt gactacccac agctgagttt tgagctaatc 1500 
tgctcctaca tcttcatgcc cttttccttc atgatgggag tggaatggca ggacagcttt 1560 

60/71 



tatctcagtc acctacttcc 
gacttacagc accaatgaga 
gtggattttc attgaccctg 
accagccaag atgctcctgg 
ggtgttctac ctgctcatcc 
ctgtgtgctc atctcctctg 
ccagaagtgt accgtcgcca 
ggccaagaag gtgcctgaaa 
cctcctggtg gtgaccatcc 
gcggtctcca cacacacact 
gctgaggatg cacgttcgcc 
acagaatggc tcctcgggat 
tcgcagtgaa ctcctcttcc 
gaagctagag aaaggcccgg 
tgccccagcc atccaggcct 
gcagagtcac tttgacaatg 
cgtctgcttc ctggccatgc 
ggcccactac aaccgggtgc 
ctcaccagac tga 



ccttcgactg gcagaactgc 540 
ttgatctgca gctgagtcag 600 
aggccttcac agagaatggg 660 
acccagcggc gccagcccag 720 
agcgcaagcc cctcttctac 780 
tcgccatcct catccacttc 840 
tcaacgtgct cctggcccag 900 
cctcccaggc ggtgccactc 960 
tcattgtcgt gaatgctgtg 1020 
ccatggcccg aggggtgttc 1080 
cgctggcccc ggcagctgtg 1140 
ggtcgatcac aactggggag 1200 
agcagtggca gcggcaaggg 1260 
agttagggct gagccagttc 1320 
gtgtggaagc ctgcaacctc 1380 
ggaatgagga gtggttcctg 1440 
tctcgctctt catctgtggc 1500 
cggccctgcc attccctgga 1560 

1593 



WO 02/22684 
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atggttgcca gactcatagg ttataagacc 
ctctcaaaat ggatccactt gaggaaagaa 
caatatatat caattcgttc tgagataatc 
atcgggtccc taggaatcgt gatcggcgga 
gatatcgcct cgggggcagt gagagctctg 
gcctgcatcg caggcatact ctccagcact 
gagaatgcct tcaactccac tttccctgga 
agtctgttga gcagcactgt tgccaagggt 
agtctgtatt ctttgaaggg ctgctgcaca 
gggatctcta atacattttg a 

<210> 41 
<211> 1225 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc„feature 

<223> Incyte ID No: 622868CB1 



ttcttcaatg aatttgtggc ttatgagcac 1620 
ggtggaccca aatttgtaaa cggtgtgcag 1680 
gccacttacg ctctctgtgg ttttgccaat 1740 
ctcacatcca tggctccttc cagaaagcgt 1800 
attgcgggga ccgtggcctg cttcatgaca 1860 
cctgtggaca tcaactgcca tcacgtttta 1920 
aacacaacca aggtgatagc ttgttgccaa 1980 
cctggtgaag tcatcccagg aggaaaccac 2040 
ttgttgaatc catcgacctt taactgcaat 2100 

2121 



<400> 41 

aattcattgg catgaagtct cggacatggg 
ttggaaccct gctggtggct ttgacaggat 
tgatcctctc cacagtgact gtccccttta 
ctttttggct tctctcagag ggacgatatg 
ccaagtggaa cagggcaagc tcctgtaaac 
gtcctgttag taatagcccc actgaagttc 
actggagcat tacgaaaagg acacttaccg 
gattctactc gttttccttg aattctgtta 
tcctcctggg tgtagtggaa attcccgcct 
tcgggaggag aacagtcctg gcctactctc 
ttatggtgat cccccagaaa cattatattt 
ttgccatcgg ggcagcattt ggcctcattt 
ttgtaagatc gctggctgtg ggaagcggca 
cgccgttctc tgtggacctc agcagcattt 
ctatggccct cctgagtgga gtgttaacac 
tagcaactac ttgggaggag gctgcaaaac 
aattacttct cacaactaat aatagtgggc 
attctggtct tggtgaataa atgtgccatg 
ccctaatgcc tttgtattag aggaatctta 
tttttaataa attttgtaag aaaattttaa 
aagatgaaaa ttctcagttt taaaa 



cgtctgtcca tttgcattcc ttttttgcag 60 
acttggtcag gacctggtgg ctttaccaga 120 
tcctgtgctg ttgggtgctc ccagagacac 180 
aagaagcaca aaaaatagtt gacatcatgg 240 
tgtcagaact tttatcactg gacctacaag 300 * 
agaagcacaa cctatcatat ctgttttata 360 
tttggctaat ctggttcact ggaagtttgg 420 
acttaggagg caatgaatac ttaaacctct 480 
acaccttcgt gtgcatcgcc acggacaagg 540 
ttttctgcag tgcactggcc tgtggtgtcg 600 
tgggtgtggt gacagctatg gttggaaaat 660 
atctttatac agctgagctg tatccaacca 720 
gcatggtgtg tcgcctggcc agcatcctgg 780 
ggatcttcat accacagttg tttgttggga 840 
taaagcttcc agaaaccctt gggaaacggc 900 
tggagtcaga gaatgaaagc aagtcaagca 960 
tggaaaaaac ggaagcgatt acccccaggg 1020 
cctgctgtct agcacctgaa atattattta 1080 
ttctcatctc ccatatgttg tttgtatgtc 1140 
agcaaatatg ttataaaaga aataaaaact 1200 

1225 



<210> 42 
<211> 2693 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7476494CB1 

<400> 42 

tcctttgaga cagctgctct gagagaatgc aataagcagg gagcagccag caattcctcc 60 

tagcagaggg cgactcgtgg gaggagttca gtttgccaag tattgtcatt tgttgagaga 120 

aggtgtgtgc tcaaggagga gttttaaccc ggaggatcat taactctttt agtcagctga 180 
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ggagctgcgg tggctcggcg agttggagtt 
gggatgaggt gtggaataac tttttcatgg 
tcaatgaagg agaagagaat gagatggaga 
aaagtctctg ccttgccgga tccatcttct 
ggagaccagc atggcacgta tgggcacatt 
ctgtgttgct gaggacaacg gtgagatgca 
actacttaga aggacagttc cagaaaattg 
agatacatca aaaatttgga tcaggcttga 
tgtgtgggcc taatactatc gatgttgaag 
aggttctaaa tccattttat atatttcaac 
actataagga atatgcfcttt gccatcataa 
tatatgatct cagagagcaa tctgtaaaac 
ttacggtctc tgtatgtggg agaaaagctg 
tgcctggaga tttattaatt ttgacaggga 
tgattgaagg cagctgtgtg gtggatgaag 
ccaaaactcc gttacccaag atggatagct 
attacaagcg gcatgtcctc ttctgtggaa 
ctgggaccgt gagagccgtg gtactgcaga 
tgagatccat tctctaccct aagccagtga 
tcctcctgtg ccttgtagga acagccacca 
tgcttagtgg ggaacctcca gaggaggtgg 
cggttcctcc ggctctacct gctgctctga 
tgaagaagag aggcatcttc tgcattagcc 
accttgtctg ctttgacaag acaggcacct 
tcgtgtcctg tgataggaat ggctttcagg 
tgccatgggg cccactgtgt gcagcgatgg 
ggaccatcca gggagaccct ctggacctca 
ctttttctgg ggacgatttc cacatcaagg 
cctgcagaac agccagccag gtcccagtgg 
tctcatcggc actgcaaaga atgacagtca 
cattcatgaa aggtgcacca gagagggtgg 
ctagttttgt tagcgaactt cagatttaca 
cctacaagaa gctggaaaat gaccatcacg 
cagacctgat atttctgggg ctgctgatct 
ctgtcttgga agagctcatc tcagcccgga 
ttcagactgc aataacagtg gccagaaaat 
ttctcattga ggcaaatgaa accaccgggt 
tagaagagaa gaaacacatt atgtatggga 
aagtctctga taaaggcaga gaaggaagtt 
atgttataag tcaacatttc agcagcctac 
ttgcaagaat gtctcctggg cagaagtcca 
aggttctttg ccagtgcagg tggcatgaac 

<210> 43 
<211> 3569 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<223> Incyte ID No: 7477260CB1 



catcctggaa gcgtctgcac gacaaggtca 240 
gacactttga gaagggccag cacgctctgc 300 
tatttggcta tcggactcaa ggctgccgga 360 
catttggaat cctccccttg gtgttttact 420 
gtgtcccatg ttccttgcaa gaagcagaca 480 
tcaaagtgca gaaaataaga tatgtttgga 540 
gttctttgga agactggctc agttctgcca 600 
caagagaaga acaggagatt aggaggttaa 660 
ttacaccaat ttggaaactg ctcatcaagg 720 
tcttcagtgt ctgtttgtgg tttagtgaag 780 
tcatgtccat aatttccata tctttgacag 840 
tccaccatct cgtcgagtca cataatagca 900 
gagttcaaga gctggaatca cgcgtcctgg 960 
acaaagtgct aatgccatgt gatgccgttc 1020 
gcatgctgac aggagaaagt attccagtca 1080 
ctgtgccctg gaaaacacag agtgaagcgg 1140 
cagaggttat ccaggccaag gcagcttgct 1200 
ctggattcaa cactgcaaag ggagaccttg 1260 
attttcagtt gtacagggat gccatcaggt 1320 
ttgggatgat ctatactctg tgtgtctatg 1380 
tgaggaaagc ccttgacgtc atcacaattg 1440 
ccacaggcat tatctatgcc cagaggaggc 1500 
cccagaggat caacgtatgt ggacagttaa 1560 
taacaaggga cggcttggac ctctggggag 1620 
aagttcacag ctttgcctca ggccaggctt 1680. 
ccagctgcca ctctctgatc cttcttgatg 1740 
aaatgtttga agccaccacc tgggaaatgg 1800 
gagtgccggc acatgccatg gtagttaagc 1860 
aaggaattgc aatcctgcat cagttcccat 1920 
ttgtccaaga gatgggaggt gaccgactgg 1980 
ccagcttttg ccaacctgag acagtaccca 2040 
cgacacaggg cttccgagtc atagcactgg 2100 
ctactacctt gacgagggag acggtagaat 2160 
tggagaatcg attgaaggaa gagacaaaac 2220 
taaggactgt aatgatcaca ggtgacaatc 2280 . 
ctggaatggt ttctgaaagc cagaaagtca 2340 
cctcatcagc atctatatct tggacgttag 2400 
atcaggacaa ttacattaac atcagggatg 2460 
accattttgc cctaactgga aaatcctttc 2520 
tgccaaagat attgatcaat gggaccatct 2580 
gtctggtgga agaatttcag aaactggagt 2640 
tgcatggagg cataacagtc agg 2693 



<400> 43 

agaccagtgt tggaggatgg cttgcttggg 
gtaagctgaa gttgggtcag agtgcttctc 
tgccctggct tcgaccattc ccccatcatt 



gcccggtggg aagaagaacc ccctcgggtg 60 
acttctctct tcagttctgg ttcttgctac 120 
cttcactgcc agctgcaaga ccctggatct 180 
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gaatgcagac taaatctttt catctctttt 
tcatggttac aggaggccag catcacccag 
atacttttcc cttgtgcttg cctcctactc 
ttccatggga cactcagcag acctgctgcc 
agctccagga catcctgggg gaggaagatg 
tggatactct gcagcatgac ggagaccaga 
agtttgaaga aaaggtagag gaaggcggcg 
tatccctgca cagcctcttc gagctccgta 
atttggacag tggctcctta ccacagatca 
atggtctcct gcggccagag ctccgggaga 
gccaccaaac caagaagccc atccaccgct 
ccacaaatcg cagtcctgcc cggagccctg 
aagacctgcg gatgcggcag agtgcaaatt 
gcatgaatga catttctctc accccaaaca 
agatccccaa ggactcagaa gcgtccaacg 
agccattcat cgcgttcgtg cgcctcatcc 
tgcctgtccc caccagattt ctgtttatac 
acaatgaaat tggccgtgcc attgcaaccc 
cctacaaagc ccgcaatcgg gaagatctga 
tcatcgtcct tcctcctgga gaatgggacc 
tgccctctgc tgacaagagg aaatctctgt 
gctctgtggg aggaggcggc ggagctcctg 
gcagtggcgg cggggetggc agtggcgggg 
agatgccagc catgcatgaa atcggggagg 
gcctgtgtct ggatatcaag aggaagttgc 
tccacattca gtccatctct gccatcctat 
tcacctttgg tgggcttctg ggggatgcca 
tcctgggcac tgccatggct ggctccttgt 
ttctcagcag cacggggccc atcctcatct 
gcaatggcct ggactacatg gagttccgcc 
gccttatcct agtggccaca gatgccagct 
aggagggctt ctccaccctt at cage t tea 
teggtgeett caagtactac cctatcaata 
acaagtgega gtgtgtcgcc cctgacacag 
cagccccatt ggcaccagac accaacgctt 
tggactggtc cctgctgagc aagaaggagt 
attcctgeaa gtttatccca gacctggcgc 
actccatgac cctgaccctg aagaagttea 
gggccctggt ggctgacttt tccattgttt 
cctgttttgg cctagaaact cccaagctgc 
ctgaccgagg ctggttcgtg gccccctttg 
gcatcctgcc cgccctgctg gtgaccatcc 
tcattgtcaa ceggaaggag aacaaactga 
tctgggtggg catcctcatg gctttgtgct 
ccacggtcat ctccatcgcc cacatcgaca 
ctggggagca gccccagttt ctgggagtca 
tcatcctgac gggaatctct gtcttcctgg 
tgctgtacgg agtcttcctc tacatgggcg 
aacgetgeaa gctcttcctg atgccagcca 
acgtgccgct gcgccggatc cacctcttca 
tctggatcct caaatccacg gtggctgcca 
teategtteg aaggcttctg gatttcatct 
acatcctccc agagaaggaa aaaaaggaga 
cccacgagga ctgtgatgag gaggaaaaag 
cttccttcag tgacacagaa cttgaccgaa 
cacaggtgaa gatagagatg gagtcagact 



atcttaaaga gtatctcgcc cacctctgat 240 
gagctggact tagttttaca gaattagaaa 300 
catttctgtt ggccttgtgg tcctcctgcc 360 
cctcttttgc agggtcccca getgetgage 420 
aggctcccaa ccccaccctc tttacagaga 480 
tggagtggaa ggagtcagee aggtggataa 540 
aacgetggag caagccccac gtgtccacac 600 
cctgcctgca gaeggggacg gtgctgctgg 660 
tagatgatgt cattgagaag cagattgagg 720 
gggtcagtta eg tec tec tg aggaggcacc 780 
ccttagctga cattgggaag tcagtctcca 840 
gtgctggccc gagtctacac cactccacgg 900 
aeggaegtet gtgtcatgcc cagagcagaa 960 
cagaccagcg gaaaaacaaa ttcatgaaga 1020 
tgctcgtggg cgaggtggac ttcctagacc 1080 
agteggecat gctgggagga gtgaccgagg 1140 
tactgggacc ttctgggaga gcaaaatcct 1200 
tcatggtaga tgatctcttc agtgacgtgg 1260 
tegcaggaat tgatgaattt ctggatgagg 1320 
caaatatccg gattgagece cccaagaagg 1380 
tctccctagc agagctgggc cagatgaatg 1440 
gaggaggcaa tggaggtggt ggtggtggtg 1500 
ccggcggaac aagcagcggg gatgatggag 1560 
aacttatctg gacaggaagg ttcttcggtg 1620 
cctggttccc aagtgacttc tatgatggct 1680: 
tcatctacct eggctgtate accaacgcga 1740- 
ccgacaatta tcagggagtg atggagagct 1800 
tctgcctctt ctegggacag cctctcatca 1860 
ttgagaagct cctcttcgac ttcagcaaag 1920 
tctggattgg cctacactca gctgtccagt 1980 
ttatcatcaa atatatcacc cgcttcaccg 2040 
tcttcatcta cgatgccatc aagaagatga 2100 
tggacttcaa gccaaacttc atcactacct 2160 
gtgacctgaa tacaacegtg ttcaatgett 2220 
ctctgtacaa cctccttaac ctcacagcgt 2280 
gtctgagcta cggtgggcgc ctgcttggga 2340 
tcatgtcctt catccttttc tttgggacat 2400 
aattcagccg ctattttcct accaaggtcc 2460 
tctccatcct gatgttctgt ggaatcgatg 2520 
atgtgcccag tgtcatcaag ccaacgcggc 2580 
ggaagaaccc gtggtgggta tacccagcaa 2640 
tgatcttcat ggaccagcag atcactgccg 2700 
agaaggctgc cggctaccat ctggacctgt 2760 
cctttatggg gctcccctgg tacgtggctg 2820 
gectcaagat ggagacagag accagtgccc 2880 
gggaacagag agtaacegge ateategtet 2940 
ctcccatcct aaagtgtatc cccctgccgg 3000 
tggcctccct gaatggcatc cagttctggg 3060 
agcaccagcc ggaccatgcc ttcctgcggc 3120 
ccctggtgca gatcctctgc ctggcggtgc 3180 
tcatcttccc ggtcatgatc ctgggcctca 3240 
tttcccagca cgacctggcc tggattgaca 3300 
cagacaagaa gaggaagaga aaaaaagggg 3360 
atcttccagt tggagttact cactctgatt 3420 
gctactcacg gaacccagtg ttcatggtgc 3480 
atgacttcac agacatggat aaataccgaa 3540 
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gagaaactga cagtgagacc accctctag 

<210> 44 

<211> 3920 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1963058CB1 

<400> 44 

cggacgcggc ggacgtgggt gagggcgcgg 
cgcgtggtgg gggtccccgg cgcctgcccc 
ccctccgcgg agcccggggg tggggggcgc 
ccagcggccg gagcggcgcc gtccccgcgg 
gccgtgggcc acttcctcaa cgacctgtgc 
tacctgcact cggtgcgcgc ctacagctcc 
caggtggccg acgggctgtg cacaccgctc 
tgctgcgccc gctacggccc gcgcaaggcc 
ctgtccttcc ccttcatctt cagcccctgc 
gctgccctcc tctactacgg cccgttcatc 
cagatctccc acctcagcct catcccggag 
ctcacggcac tcaggtatgc gttcaccgtg 
tggctcctgc tgcacctgca gggctcgtcg 
agcgaccagc tggggggcca ggacgtgccc 
ggtgtcggcg ccgtgttctc actgctattc 
catgcggagg agccaggcga gcacaccccc 
ctgctctgga agcactggct ccgggagccg 
accaccaggc tcatcgtgaa cctgtcccag 
ctccacctgc ccaagaagtt catcgcgacc 
ttgtcctcct tcctcatgaa gcccatcaac 
tcaggcctcc tggtgatcct ggcctttgcc 
gtggccgtgt acgcagcggc tgtgctgctg 
tcgctggcca tgacggccga cctcatcggt 
ggctccatga gcttcttgga taaggtggcc 
ctgcaccctt gcccctcaga gctctgctgc 
atggtggctg tgacgggcgg cgtgggcgtg 
ctgtggccga cccgcctgcg acgctgggac 
tcctgcacct gtgcaaggga actgtgggga 
aaaagccccc actgcccctc actcttctct 
ccgggggtgg ggtcgggtga gggcagcagg 
tgggttttga gggtgtccca ttctcaactc 
ggtgcccctc tcggcaggga acaggaagta 
accctgagct cagtccagtt cacccctcac 
gggccccctc aggacggctg gagcctggag 
ggaccccacc gtggtgggca gcagggctgc 
aaataaagag atgacggcag cctggctcct 
tagcctgggc atctcagccc tgccctggtt 
ctgtcccgag cctctggtcc cctgggacta 
cagtgggtgc atcctggtag catgccttta 
ccgctggcag gtggaggggc agtatggttc 
gccgggcggg cgctggggtg ccgacaggga 
ctccaccccg taccggttcc tgttgagcac 
ccacctcctc gccgttggcc agcaaggggc 
gcctaggtgg ccttgtcccc ctggcctccc 



3569 



ccgtaagaga gcgggacgcg gggtgcccgg 60 
cacggcaccc aagaaggcct ggccagggta 120 
ggggccggcg ccgcgatggg cccgggaccc 180 
ccgctgtccc tggtggcgcg gctgagctac 240 
gcgtccatgt ggttcaccta cctgctgctc 300 
cgcggcgcgg ggctgctgct gctgctgggc 360 
gtgggctacg aggccgaccg cgccgccagc 420 
tggcacctgg tcggcaccgt ctgcgtcctg 480 
ctgggctgtg gggcggccac gcccgagtgg 540 
gtgatcttcc agtttggctg ggcctccaca 600 
ctcgtcacca acgaccatga gaaggtggag 660 
gtggccaaca tcaccgtcta cggcgccgcc 720 
cgggtggagc ccacccaaga catcagcatc 780 
gtgttccgga acctgtccct gctggtggtg 840 
cacctgggca cccgggagag gcgccggccg 900 
ctgttggccc ctgccacggc ccagcccctg 960 
gctttctacc aggtgggcat actgtacatg 1020 
acctacatgg ccatgtacct cacctactcg 1080 
attcccctgg tgatgtacct cagcggcttc 1140 
aagtgcattg ggaggaacat gacctacttc 1200 
gcctgggtgg cgctggcgga gggactgggt 1260 
ggtgctggct gbgccaccat cctcgtcacc 1320 
ccccacacga acagcggagc gttcgtgtac 1380 
aatgggctgg cagtcatggc catccagagc 1440 
agggcctgcg tgagctttta ccactgggcg 1500 
gccgctgccc tgtgtctctg tagcctcctg 1560 
cgtgatgccc ggccctgact cctgacagcc 1620 
cgcacgagga tgccccccag ggccttgggg 1680 
ggacccccac cctccatcct cacccagctc 1740 
gatgcccgcc agggacttgc aaggaccccc 1800 
taatccatcc cagccctctg gaggatttgg 1860 
ggaatcccag aagggtctgg gggaacccta 1920 
ctccagcctg ggggtctcca gacactgcca 1980 
gagacagcca cggggtggtg ggctgggcct 2040 
ccggcaggct tggtggactc tgctggcagc 2100 
gtctgcctgc gggggggctc tgggcagggg 2160 
gtgggcggcc agcgagccca gtgtctgcct 2220 
ggttagtgcc ccctcatctg ggtgcagaga 2280 
tcggggagtg ggtgtgaggg aaggcgggga 2340 
caggacccac tcccggtagt tctgggtggt 2400 
gggcacgtag tctgatgccc tccacagtgg 2460 
tgtaggtggg actcgggtca ccatgtgccc 2520 
tcctggatcg ccccgggcag tttcaccctg 2580 
aaggacccac cctgcaccta gcctcaccgt 2640 
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attccttgcc ccggattggc ctgtctttcc acagcgcgct cccccaccgg gtgctggggg 2700 

cctggtactg ggcagggacg atggggtcat gccaggcggt ctcccgcagg tgctgggtgt 2760 

aggctgcggt ggggcggggg ctggcggtca ttcctgtccc cctctggcag gcccgctgcc 2820 

cagggcgggg gggggggcac tcacccgatg gcatgctgca ctcacggtgt tggtagcagc 2880 

tgtgccagcg gttgtaggcc tcgcggaagg ggctggccgg ggcccgtggc aggttgtacc 2940 

aggcttccca ggcgtccgag ttggtcaggc ctgtgtacca cagctggccg gctgcgtccc 3000 

gtcccatggg cgtgtacttc cagcgtgtgg cctgcctgat ggctggtggc cagcgggaac 3060 

cctccaggga caggtagtca tcgctgaggg tgggcagggg gcagagactg agcccatgtc 3120 

tacagcgagt gctttgaccc ctttgcgatg tctgccaggg tggatgatgt agaggcctgg 3180 

cccacggcgt ggggtctccc tccctcgcca cttggagtct gtccttcagc cctgtacccc 3240 

tcaccccaga gtgggtgctt gaggagagag gctgactccc cctctcccca cacatcgcac 3300 

cccaagcacc caagtcagca ctaaaccttt ctgttctcag cttttcttgc ctggagaaga 3360 

gggaggggag aggacaaggg ccctggctac tcctggattc ctacagtcct tgtccagcct 3420 

ccaagaccca caagtccctt cctctgggaa gcccccctgg cctggaggtg caccaggaag 3480 

aagtggtctg gggctggcac taagccatgg cccagggaag actgggggac ccactaggcc 3540 

aggtgtgtgg ctcacgcttg taaacccagc actttgggag gctgaggcag gtggatcact 3600 

tgaggtcagg agttcgagac cagcctggcc agcatggtaa aaccccatct ctactaaaaa 3660 

tacgaaaatt aagccaggca tggtgtgggg gcggggggca cctgtaatcc cagctactca 3720 

ggaggctgag gcaggagaat cgcttgaacc caggaagtgg agtttgcagt aagctgagat 3780 

cgtgcccttt gactccagcc ctgggaaaag agtgagactc cgtcctcaaa aaccaaaggg 3840 

ccaggagact caaagaatgt cttatgcttt gaaccttgct ccttggaata atgtcccagg 3900 

gaagtcatcc cagaaaacaa 3920 

<210> 45 
<211> 1361 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2395967CB1 



<400> 45 

ctggaagcat gtcggagttt tggttaattt ctgcccctgg cgataaggaa aatttgcaag 60 
ctctggagag gatgaatact gtaacctcca agtccaacct gtcttataat accaaattcg 120 
ctattcctga cttcaaggtg gggaccttgg attccctggt tggcctctct gatgagttgg 180 
ggaaactcga cacctttgct gaaagcctca taaggagaat ggctcagagc gtggtggaag 240 
tcatggagga ctcaaagggg aaggtccagg agcacctcct ggcaaacgga gttgacttaa 300 
catcctttgt gacccacttt gaatgggaca tggccaaata tcctgtcaag cagccgctcg 360 
tgagtgtggt ggacacaata gccaagcaac tggcgcagat cgagatggac ctgaagtccc 420 
gaacggccgc ctacaacact ctgaagacaa acctggagaa cctggaaaag aaatccatgg 480 
ggaacctctt cacccggaca ctgagtgata ttgtgagcaa agaggacttc gtgctggatt 540 
ctgaatatct cgtcacactt ctggtcatcg tccccaaacc aaactactca caatggcaaa 600 
aaacctacga atctctctca gacatggtag tccctcgatc aaccaaactc attactgagg 660 
acaaggaagg gggccttttc actgtgactc tgtttcgaaa agtgattgaa gatttcaaaa 720 
ccaaggccaa agaaaacaag ttcactgttc gtgaatttta ctatgatgag aaggaaattg 780 
aaagggaaag ggaggagatg gccagattgc tgtctgataa gaagcaacag tatggccccc 840 
tgctgcgctg gctcaaggtg aacttcagtg aagccttcat tgcctggatc cacatcaagg 900 
ccctgagagt gtttgtggag tccgtgctca ggtatggact accagtgaac ttccaggcag 960 
tgctcctgca gccgcataag aagtcatcca ccaagcgttt aagagaggtt ctaaactctg 1020 
tcttccgaca tctggatgaa gtagccgcta caagtatact ggatgcatct gtggagatcc 1080 
cgggactgca actcaataac caagactatt ttccttatgt ctacttccat attgacctta 1140 
gtcttcttga ctagaaaggc cagctggcac ctctgtctca tgttcgtgca gattattaca 1200 
gacacctctt tcctttagcc agagaatggt tcaaatgtct tacagaacta agatcttttt 1260 
cagagaaatt gctcacaaaa gttagtgaca gttgtattta tttttttaag ttacaataaa 1320 
atgctctcaa gtcctttgaa tgttccaaca aattcaaaaa a 1361 
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<210> 46 

<211> 1867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3586648CB1 

<400> 46 

cagaattagc cggtatagga atgaacgagc atgaagattt gaaattgctc cgattggaag 60 

gaagcccagg ttaggtttgg gcacctccaa acgcacccgt tttaaagcca cctggactga 120 
ggcgtcgagc ttt cage tec accaaacgct cacctggcct ggcagegage ggeggaagag 180 

cccgggagcc cctcacagag cgcaccgagc egggeggaga gctgagccgc aggcacccgc 240 

gtctccagga tgataggega cattgeaaca aatctctaca cccagcagct cagggggctc 300 

caagcagagc agcaagttcg aggatceggg cgtggagccg agtgaggccg cagcccagcg 360 

ggcctcgggc gaaaaatctt ggaaaatgta taccagtcat gaagatattg ggtatgattt 420 

tgaagatggc cccaaagaca aaaagacact gaagccccac ccaaacattg atggcggatg 480 

ggcttggatg atggtgctct cctctttctt tgtgcacatc ctcatcatgg gctcccagat 540 

ggccctgggt gtcctcaacg tggaatggct ggaagaattc caccagagcc gcggcctgac 600 

cgcctgggtc agctccctca gcatgggcat caccttgata gtgggccctt teateggett 660 

gttcattaac acctgtgggt gccgccagac tgegatcatt ggagggctcg tcaactccct 720 

gggctgggtg ttgagtgcct atgctgcaaa cgtgcattat ctcttcatta cttttggagt 780 

cgcagctggc ctgggcagcg ggatggccta cctgccagcg gtggtcatgg tgggcaggta 840 

tttccagaag agacgcgccc tcgcccaggg cctcagcacc aeggggaccg gatteggtae 900 

gttcctaatg actgtgctgc tgaagtacct gtgegcagag tacggctgga ggaatgecat 960 

gttgatccaa ggtgccgttt ccctaaacct gtgtgtttgt ggggegctea tgaggcccct 1020 

ctctcctggt aaaaacccaa acgacccagg agagaaagat gtgcgtggcc tgccagcgca 1080 

ctccacagaa tctgtgaagt caactggaca gcagggaaga acagaagaga aggatggtgg 1140 

getegggaac gaggagaccc tctgcgacct gcaagcccag gagtgccccg ateaggcegg 1200 

gcacaggaag aacatgtgtg ccctccggat tctgaagact gtcagctggc tcaccatgag 1260 

agtcaggaag ggcttcgagg actggtattc gggctacttt gggacagect ctctatttac 1320 

aaatcgaatg tttgtagcct ttattttctg ggctttgttt gcatacagca gctttgtcat 1380 

ccccttcatt cacctcccag aaatcgtcaa tttgtataac ttateggage aaaacgacgt 1440 

tttccctctg aegtcaatta tagcaatagt tcacatcttt ggaaaagtga tcctgggcgt 1500 

catagccgac ttgecttgea ttagtgtttg gaatgtcttc ctgttggcca acttcaccct 1560 

tgtcctcagt atttttattc tgccgttgat gcacacgtac gctggcctgg eggtcatctg 1620 

tgegctgata gggttttcca gtggttattt ctccctaatg cccgtagtga ctgaagactt 1680 

i ggttggcatt gaacacctgg ccaatgccta cggcatcatc atctgtgcta atggcatctc 1740 

tgcattgctg ggaccacctt ttgcaggtaa actctctgag gttttaagag ctcagagtgc 1800 

atgtacatat ggtgcgttat gttataaagt cccagattaa gaaacaaaaa aaaaaaaaaa 1860 

agategg 1867 

<210> 47 

<211> 2211 

<212> DNA, 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7473396CB1 

<400> 47 

atgeagaata ttaccaaaga atttggaaca ttcaaggcaa atgacaacat caatttacaa 60 
gtaaaggcag gagagattca tgcgttgctt ggagaaaacg gtgctggcaa atctacattg 120 
atgaacgtgc tttceggatt attagagecg acatcaggga aaattttgat gcgtgggaaa 180 
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gaagtacaga tcacaagccc gacaaaagcc 
cactttatgc ttgttgatgc ctttactgta 
agtcgtgcag ggatgcttga ccataaaaaa 
caatatggat tatcagtcaa cccggatgct 
caacgggtag aaattttaaa aacactttac 
ccgacagctg tattgacccc tcaggaaatt 
gtcaaagaag gcaagtcaat cattttgatt 
gctgaccgtt gtacagttat ccgccgtgga 
gttacctcac agcaattagc tgatatgatg 
aaaaaagaag cgaagcctca agaagtcgtt 
aatcgtggat tagaagccgt gaaaaacctg 
ggtatcgctg gaatcgatgg aaacgggcag 
cgaaaggcag aaagcggaca tatcaagcta 
cgaaagatca ctgaacatgg tgtaggacat 
gtcctagata tgacattgtc tgaaaacatt 
agtaaaaacg gtatgctgaa ttattcagtg 
gaatatgatg ttcgaacaac gaatgaactt 
cagcaaaaag caatcatcgc tcggatagtc 
aatccaactc gtgggctgga tgtaggagaa 
ggaaagagta cattggtcaa tagtatctta 
aattctgcta agccaggtaa attcaagaca 
atcgatatcg atcaaagccc aatcggccgg 
agtgtatttg atgatatccg tgggttattt 
tataagaaag ggcgttttag tttcaacgta 
gatggaatta ttaagatcga aatgcacttt 
tgtcatggca aacgatataa ctctgaaaca 
gctgatattt tggaaatgac agtagaagat 
attcatcgca aactgcaaac gattgttgat 
ccagcaacga cattgtccgg tggtgaggca 
aaaatctcta atggaaagaa tttctatata 
gatgacatcg cccgcttgtt gcatgtatta 
ttagtgattg aacacaatct agatgtaatc 
ccagaaggtg gagaaggtgg aggaacgatc 
aacgtaaaag aaagttatac aggtcactat 

<210> 48 

<211> 1446 ' * 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID Nor 7476283CB1 



aatcaattag ggattgggat ggtccatcag 240 
acagaaaaca tcgtgttggg aagcgaacct 300 
gcgcgaaaag agatccaaaa agtttctgaa 360 
tatgttcgtg atatttcagt tgggatggaa 420 
cgaggagcag atgtactgat ttttgatgag 480 
gatgaattaa tcgtgatcat gaaggaatta 540 
acgcataagt tagatgaaat caaagcagta 600 
aaaggaatcg gtacagtcaa cgttaaagac 660 
gtcggaagag cggtttcatt caaaacgatg 720 
ttgtctattg aaaatctagt ggtaaaagaa 780 
aacttagagg ttcgtgctgg cgaagtactt 840 
tcggagttga tccaagcttt gactggtttg 900 
aaaggggaag acatcaccaa taaaaaacct 960 
gtgccagaag accgtcataa atacgggttg 1020 
gccctgcaaa cgtatcatca aaaaccttac 1080 
ataaatgaac atgccagaga attgatcgaa 1140 
gttcctgcaa aagctttatc aggcggaaat 1200 
gaccgagatc ctgatctgtt gatcgttgca 1260 
tttgtagcag tcacaggtgt gtctggttct 1320 
aagaaatcgt tagcgcaaaa attaaataag 1380 
atttccggct acgaaagtat cgaaaagatc 1440 
acgccgagaa gtaatccagc gacttataca 1500 
gctcaaacga acgaggcaaa aatgcggggt 1560 
aaaggcggtc gttgtgaagc ttgtcgcggg 1620 
ttgcctgatg tctatgttcc ttgtgaagta 1680 
ttagaagtgc attacaaagg aaaaagcatt 1740 
gctgtagaat tcttcaagca cattccaaag 1800 
gttggcttag gttatgtgac tatggggcaa 1860 
caacggatga aacttgccag tgaattgcac 1920 
ctagatgaac caacgacagg acttcatagc 1980 
caaagattag tagatgctgg taacacagtt 2040 
aaaacagcag attatatcat tgatttagga 2100 
cttacgactg gaacaccaga agaaatcatt 2160 
ttgaaaaaaa taatggtata a 2211 



<400> 48 

tggctgggag aattgagcta 
aaagcttttc tcttctcccc 
ttcttgttac tgaccctgca 
gaggatttaa aggtgaacaa 
acacaaattc tgaattcatt 
gtgaggccca cagtaattga 
attaatatgg aatatacaat 
aaattcaata gtaccatgaa 
attcctgaca ctttcttcag 
aatcgtctgc ttcgaatttg 
aatgcagaat gttatcttca 



gtgcagcaca cgtaaaaaag 
ttttcttctg cggagtcaaa 
tttgggaaac tgtgttgata 
aacctgggtc ttggccccaa 
gcttcaaggc tatgacaata 
aactgatgtt tatgtaaaca 
agatataatt tttgcccaaa 
agtgcttatg cttaacagta 
aaactcaaga aaatctgatg 
gaatgatgga cgagttctgt 
gcttcataac tttcccatgg 



cgattccgat gggtcctttg 60 
gtagaggggt gaggttggtc 120 
aggcagatga tgaagatgat 180 
aaattcatga aggagatatc 240 
aacttcgtcc agatatagga 300 
gcattggacc agfctgatcca 360 
cctggtbtga cagtcgttta 420 
atatggttgg aaaaatttgg 480 
ctcactggat aacaactcct 540 
atactctaag attgacaatt 600 
atgaacattc ctgtccactg 660 
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gaattttcaa gcgatggata 
gtagaagtgg ctgatcctaa 
aactcaactg aaatcactca 
gacctgagca gaagaatggg 
gttgttcttt cttgggtgtc 
ttgggtatca ctacagttct 
cctaaggttt cttatgtgac 
tttgcagcct tgatggaata 
actgctacta aagacagaaa 
ggatccactc tgattccaat 
cagtgtttgg agggcaaaga 
ggatcttgga gggaaggaag 
atatttttcc caaccgcttt 
ttataa 

<210> 49 
<211> 1332 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 7477105CB1 

<400> 49 

ttcggctcga gggaccccag gccgggccgg gccgagaggc tgccatgggc tccgtgggga 60 
gccagcgcct tgaggagccc agcgtggcag gcacaccaga cccgggcgta gtgatgagct 120 
tcaccttcga cagtcaccag ctggaggagg cggcggaggc ggctcagggc cagggcctta 180 
gggccagggg cgtcccagct ttcacggata ctacattgga cgagccagtg cccgatgacc 240 
gttatcacgc catctacttt gcgatgctgc tggctggcgt gggcttcctg ctgccataca 300 
acagcttcat cacggacgtg gactacctgc atcacaagta cccagggacc tccatcgtgt 360 
ttgacatgag cctcacctac atcttggtgg cactggcagc tgtcctcctg aacaacgtcc 420 
tggtggagag actgaccctg cacaccagga tcaccgcagg ctacctctta gccttgggcc 480 
ctctcctttt tatcagcatc tgcgacgtgt ggctgcagct cttctctcgg gaccaggcct 540 
acgccatcaa cctcjgccgct gtgggcaccg tggccttcgg ctgcacagtg cagcaatcca 600 
gcttctacgg gcaccgcctg gcccagcctc caccagggac ccctcctcat gaactctgga 660 
gccctgagag gagaggggca gccccccacc ttgtcaccct cagggcttcc ccttctgtcc 720 
tcattcttag agactgcttc tcccaaacat aacgcgttag ccatgaagga gtcggagccc 780 
tgggtccgaa tggacccgcc tgcggtctgc atcagcctct gggaaaccac agcagtgatg 840 
ccagctgggc acgtcaggac ctccccacac acccacacga tgccacaggt cagggggctg 900 
tgcctgacta gggagccctc ccattgcctt cctggcccgg gatagaagag gggaggtaag 960 
tctgggggct acgaagccgg gcccccacac cctggctgaa gtcagcttga cctaggtctt 1020 
gaccctcatc cagcaaggga ctcgacagac ccaagggtcc ctggaacgta gggaggggct 1080 
gggggtcact ccagcccggg cctcccagaa caccaggccc gtgtgggtgg caccctgagg 1140 
tcaggggatc ctaagggtgt ccttccagag acggtgtttc cagggggagg accgcccccg 1200 
cttccagatc cccggccccg gctgtgactg ccctgtttca cccctgctgt gtcccatccc 1260 
ccgtctgtcc actaactgta ccgcaccggc cattaaaaga tgaaggcaga ccgctggaaa 1320 
aaaaaaaaaa aa 1332 

<210> 50 
<211> 2298 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 



ccctaaaaat gaaattgagt 
atactggaga ttatatcagt 
cacgatctct ggggattatg 
atatttcact attcagacct 
tttttggatc aataaagatg 
gactatgaca accctgagta 
tgcgatggat ctctttgttt 
tggaaccttg cattatttta 
gctaaaaaat aaagcctcga 
gaataatatt tctgtgccgc 
ttgtgccagc ttcttctgtt 
gatiacacata cgcattgcca 
tgccctgttc aacttggttt 



ataagtggaa aaagccctcc 720 
ttgeatttgt agggttacgg 780 
ttatcatgac aatttttttt 840 
acattccatg cattctgaca 900 
cagtgcctgc aagaacatcg 960 
caattgccag gaagtcttta 1020 
ctgtttgttt catttttgtt 1080 
ccagcaacca aaaaggaaag 1140 
tgactcctgg tctccatcct 1200 
aagaagatga ttatgggtat 1260 
gctttgaaga ctgcagaaca 1320 
aaattgactc ttattctaga 1380 
attgggttgg ctatctttac 1440 

1446 
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<223> Incyte ID No: 7482079CB1 
<400> 50 

atgctcaaac agagtgagag gagacggtcc tggagctaca ggccctggaa cacgacggag 60 
aatgagggca gccaacaccg caggagcatt tgctccctgg gtgcccgttc cggctcccag 120 
gccagcatcc acggctggac agagggcaac tataactact acatcgagga agacgaagac 180 
ggcgaggagg aggaccagtg gaaggacgac ctggcagaag aggaccagca ggcaggggag 240 
gtcaccaccg ccaagcccga gggccccagc gaccctccgg ccctgctgtc cacgctgaat 300 
gtgaacgtgg gtggccacag ctaccagctg gactactgcg agctggccgg cttccccaag 360 
acgcgcctag gtcgcctggc cacctccacc agccgcagcc gccagctaag cctgtgcgac 420 
gactacgagg agcagacaga cgaatacttc ttcgaccgcg acccggccgt cttccagctg 480 
gtctacaatt tctacctgtc cggggtgctg ctggtgctcg acgggctgtg tccgcgccgc 540 
ttcctggagg agctgggcta ctggggcgtg cggctcaagt acacgccacg ctgctgccgc 600 
atctgcttcg aggagcggcg cgacgagctg agcgaacggc tcaagatcca gcacgagctg 660 
cgcgcgcagg cgcaggtcga ggaggcggag gaactcttcc gcgacatgcg cttctacggc 720 
ccgcagcggc gccgcctctg gaacctcatg gagaagccat tctcctcggt ggccgccaag 780 
gccatcgggg tggcctccag caccttcgtg ctcgtctccg tggtggcgct ggcgctcaac 840 
accgtggagg agatgcagca gcactcgggg cagggcgagg gcggcccaga cctgcggccc 900 
atcctggagc acgtggagat gctgtgcatg ggcttcttca cgctcgagta cctgctgcgc 960 
ctagcctcca cgcccgacct gaggcgcttc gcgcgcagcg ccctcaacct ggtggacctg 1020 
gtggccatcc tgccgctcta ccttcagctg ctgctcgagt gcttcacggg cgagggccac 1080 
caacgcggcc agacggtggg cagcgtgggt aaggtgggtc aggtgttgcg cgtcatgcgc 1140 
ctcatgcgca tcttccgcat cctcaagctg gcgcgccact ccaccggact gcgtgccttc 1200 
ggcttcacgc tgcgccagtg ctaccagcag gtgggctgcc tgctgctctt catcgccatg 1260 
ggcatcttca ctttctctgc ggctgtctac tctgtggagc acgatgtgcc cagcaccaac 1320 
ttcactacca tcccccactc ctggtggtgg gccgcggtga gtacctttgc cctgggcttt 1380 
cccatcctct tccccagccc agtgagctgc tcctccctcc cctggttatc agccaccagg 1440 
ctttggcttc tgatcctcgt cttccccccc acccccaatc gccgcataca gctaacaaaa 1500 
cggcgatgga tgtcaaaagt ggtggaaaga gaactcagca gatcagtaaa ctccagcagc 1560, 
cacatgtcga tggctgtggc aaagaacaag agagagaatg caagccccat catgcaaaca 1620 
cttcataagt ttcttttcat ggcatttgct cagcccattg gccagagtaa gtcacatggc 1680 
caagctgcaa gtcaaagggc agggcaggtg agcatctcca ccgtgggcta cggagacatg 1740 
tacccagaga cccacctggg caggtttttt gccttcctct gcattgcttt tgggatcatt 1800 
ctcaacggga tgcccatttc catcctctac aacaagtttt ctgattacta cagcaagctg 1860 
aaggcttatg agtataccac catacgcagg gagaggggag aggtgaactt catgcagaga 1920 
gccagaaaga agatagctga gtgtttgctt ggaagcaacc cacagctcac cccaagacaa 1980 
gagaattagt attttatagg acatgtggct ggtagattcc atgaacttca aggcttcatt 2040 
gctctttttt taatcattat gattggcagc aaaaggaaat gtgaagcaga catacacaaa 2100 
ggccatttcg ttcacaaagt actgcctcta gaaatactca ttttggccca aactcagaat 2160 
gtctcatagt tgctctgtgt tgtgtgaaac atctgacctt ctcaatgacg ttgatattga 2220 
aaacctgagg ggagcaacag cttagatttt tcttgtagct tctcgtggca tctagctcaa 2280 
taaatatttt tggacttg 2298 

<210> 51 
<211> 2250 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 55145506CB1 

<400> 51 

agaaacagat ctctcggatc aataagcatg aatgacgaag actacagcac catctatgac 60 

acaatccaaa atgagaggac gtatgaggtt ccagaccagc cagaagaaaa tgaaagtccc 120 

cattatgatg atgtccatga gtacttaagg ccagaaaatg atttatatgc cactcagctg 180 
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aatacccatg agtatgattt tgtgtcagtc tataccatta agggtgaaga gaccagcttg 240 
gcctctgtcc agtcagaaga cagaggctac ctcctgcctg atgagatata ctctgaactc 300 
caggaggctc atccaggtga gccccaggag gacaggggca tctcaatgga agggttatat 360 
tcatcagccc aggaccagca actctgcgca gcagaactcc aggagaatgg gagtgtgatg 420 
aaggaagatc tgccttctcc ttcaagcttc accattcagc acagtaaggc cttctctacc 480 
accaagtatt cctgctattc tgatgctgaa ggtttggaag aaaaggaggg agctcacatg 540 
aaccctgaga tttacctctt tgtgaaggct ggaatcgatg gagaaagcat cggcaactgt 600 
cctttctctc agcgcctctt catgatcctc tggctgaaag gagtcgtgtt caatgtcacc 660 
actgtggatc tgaaaagaaa gccagctgac ctgcacaacc tagcccccgg cacgcacccg 720 
cccttcctga ccttcaacgg ggacgtgaag acagacgtca ataagatcga ggagttcctg 780 
gaggagacct tgacccctga aaagtacccc aaactggctg caaaacaccg ggaatccaac 840 
acagcgggca tcgacatctt ttccaagttt tctgcctaca tcaaaaatac caagcagcag 900 
aacaatgctg ctcttgaaag aggcctaacc aaggctctaa agaaattgga tgactacctg 960 
aacacccctc taccagagga gattgacgcc aacacttgtg gggaagacaa ggggtcccgg 1020 
cgcaagttcc tggatgggga tgagctgacc ctggctgact gcaatctgtt gcccaagctc 1080 
catgtggtca agacccacct tctcacttcc tccagcaact tcctaaggaa caagtaccac 1140 
tgaaagggat gatataattc cagctcagtc acactgtgtc agagtgatac aatgcaaaga 1200 
tcaggagacc cgagttccgg tcctgtattt gctgccaact agcagcatga gctgaggcac 1260 
atcatttaat ctttttggaa ttcatttttc tcatgcctag aagaacagaa gtggattgta 1320 
ttccttcttg ccttcttttc ctttcttctt tccctccttc tttccttttc tcttgctcaa 1380 
acatgtattc actaccactc aaaaaccatt tgttgaacaa agcaaacaaa tgaatctccc 1440 
aagccttggg cttcatcctg tgatttcctc aattcccacc tgccttaaat tactcagtga 1500 
agccctgtcc ttggagaaaa ttcagtgggt ggttaaccca gagaagctgg agatcaaaaa 1560 
gaagatggcc aatgaaagaa caaaggccag cccttggccc ctatctcttt ggatttctgc 1620 
tgatccagct tatcagatcc cagaaacctg gcaaacctct aaagttcaca aagagcgaag 1680 
gggaagccaa gtcaggcctc cagtttggct tcggatgcca aaacttaatc tgggctgtgg 1740 
gagctaactg ttttcatatg aaagagcaaa ttcagaacat gagcatggaa gtccctgcga 1800 
acgtcagatc tccgtgtgca tccttacccc cttgctgctt tcatgctcac tctcctcttg 1860 
cgtggctcgc tttcaggttt atctccatcc ctggaagcag agttgctctg gcccaggctc 1920 
tccatgagag tttggcttga acattcattg tctggccccc tcctagttct catctcccaa 1980 
agtcaagcca atgtgtgaag aaatgaccag ctcagcagcc aaggcccagg gtgcacaggt 2040 
cttcgttggg agaggcatct gcaggccttt ccttgcccac tgggatcctt gcctagcata 2100 
gtgacgatgt tcagccctgg agacaaacaa gaaggggaac accaacatca atagaagtat 2160 
atatttacaa attgcatttc tgctgtattg aaactaacat tctgcccttt aaaatcctga 2220 
aaataaaatt tcagtatgaa atgaaaaaaa 2250 

<210> 52 
<211> 3430 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5950519CB1 

<400> 52 

gagctgaccc tgcggggtcc cgggggggga gggggagccg cgaagccccc actgaggccg 60 
ccgctgccgg gcctcccctc ccccccgggc gggcgccatg cgggggagcc cgggcgacgc 120 
ggagcggcgg cagcgctggg gtcgcctgtt cgaggagctg gacagtaaca aggatggccg 180 
cgtggacgtg cacgagttgc gccaggggct ggccaggctg ggcgggggca acccagaccc 240 
cggcgcccaa cagggtatct cctctgaggg tgatgctgac ccagatggcg ggctcgacct 300 
ggaggaattt tcccgctatc tgcaggagcg ggaacagcgt ctgctgctca tgtttcacag 360 
tcttgaccgg aaccaggatg gtcacattga tgtctctgag atccaacaga gtttccgagc 420 
tctgggcatt tccatctcgc tggagcaggc tgagaaaatt ttgcacagca tggaccgaga 480 
cggcacaatg accattgact ggcaagaatg gcgcgaccac ttcctgttgc attcgctgga 540 
aaatgtggag gacgtgctgt atttctggaa gcattccacg gtcctggaca ttggcgagtg 600 
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cctgacagtg ccggacgagt tctcaaagca 
gctggtggcc ggcgcagtgg caggtgccgt 
cctcaaggtc ttcatgcagg tccatgcctc 
gcttcgaagc atggtccttg agggaggcat 
tgtactcaag attgcccccg agtcagctat 
ggccatcctg gggcagcagg agacactgca 
ggctggtgcc acagcccaaa ccatcattta 
cttgcgccgg acgggccagt ataaggggct 
ggaggggccc cgtgccttct accgcggcta 
tgcgggcatc gacctggccg tctacgagac 
ccacgactcg gcagacccag gcatcctcgt 
ctgcggccag atagccagtt acccgctggc 
ctccatcgag ggtggccccc agctgtccat 
ggagggcatg cggggcctct accgggggat 
tgtgagcatc tcctatgtgg tctacgagaa 
gtgagggacc cggagcccgt ccccccaatc 
ctgatgatcc aaccacagga tccctactct 
ggatcctaga ctcctatgcc ccaaccattg 
atcctagact cctatgcccc aaccactggg 
tcccagatcc ccctgtaacc ataactgtgg 
ccctaaccac aagattcaag attatccaca 
actggatcct gcagccccac atcctaaggt 
gatcctggac ctctacgtct taaccactgg 
caccccaacc acaggagcac ggattccctg 
ggcaccagat ccagtgtcct agtgaaacgc 
atgcctcgag ccctggatct ccaagctcag 
ccactggatc ctgacaacca caatgcctgg 
tcccctcact ccacccactg gattcctgca 
ctcgacactg ggtctcagat ccttctgctg 
gccttccacc cccaggcact ggatcccaga 
tcctaaaaca agtgcggggg ccccagtggc 
ctggattcca gattctgggt ccccaatccc 
tctcttgaag gcagatcctg atattccaag 
ttctgatctg aatcttccag gcgccgggtc 
ggcagcccag tcacagagta tcccacacac 
tgaaattgca tggtcgaggg agtctgtggg 
cacggatgct ggattccccc tcaccccacc 
gggaaactgt tccctggaac cactccactc 
atccccctcc caccttctag gcgaatagtc 
ggaatcactc actcctggag gctggcaagg 
gatgtcccca ccccagcact atcagggcct 
aaaatcggca aagaccaccc ttccctaaac 
caaaaaaaaa ccaccaaatc cccaaattca 
agcaaagtcc tcctcccctg cgcccttaca 
aggatcatct ggagacccca tggagggact 
ctggaggact tccgcacgct tccacctcct 
tgtgtgtgtt atttcaaagg aaaagaacaa 
aaaaaaaaaa 



agagaagctg acgggcatgt ggtggaaaca 660 
gtcacggaca ggcacggccc ctctggaccg 720 
aaagaccaac cggctgaaca tccttggggg 780 
ccgctccctg tggcgcggca atggtattaa 840 
caagttcatg gcctatgaac agatcaagag 900 
tgtgcaggag cgcttcgtgg ctggctccct 960 
ccctatggag gtgctgaaga cgcggctgac 1020 
gctggactgc gccaggcgta tcctggagag 1080 
cctccccaac gtgctgggca tcatccccta 1140 
tctgaagaac tggtggcttc agcagtacag 1200 
gctcctggcc tgcggtacca tatccagcac 1260 
cctggtccgg acccgcatgc aggcacaagc 1320 
gctgggtctg ctacgtcaca tcctgtccca 1380 
cgcccccaac ttcatgaagg ttattccagc 1440 
catgaagcag gccttggggg tcacgtccag 1500 
cctcaccccc cacacctcag ccactggaga 1560 
ttggccacga gatcccagta cccagatcct 1620 
ggtcatggga tcccagcacc cagatcctgg 1680 
tcatgcgatc cccacccttc agccactaga 1740 
atcccttact tcagcaactc aagtctgcta 1800 
ccccagccct taatccccat cccccaaatc 1860 
ggatcccacg cttccctgtg ccccctactg 1920 
atcccacaca aatcagtgaa tggatcccaa 1980 
tacctcaaca cccagaccct gcctccctca 2040 
tggatcctag atccccaacc ccagatcccc 2100 
ctgctggatt ctggatgtca acaaacctca 2160 
atcctggggc ccccatcact ggatcccaga 2220 
ttggtttttg gttttttgtt tttttttaac 2280 
actgccagat ccctgcattt caagcactac 2340 
ttcccaagcc ttcacccacc agattctggc 2400 
acagcaagtg gatcctggca actgcagctg 2460 
tctgcccagt ccctcaatgt tgaaacctca 2520 
gcactgaatc ccaagccctg aatccccggt 2580 
ccaaatgttc aggccccaag tctagatcct 2640 
actggtgccc agagccggct tctcatgaca 2700 
gaaggaagcc caggtcctgg ctgcaacctg 2760 
tctgcatggc caccccctcc cagccctgtg 2820 
cctgcatccc cacacttcac agcatcttcc 2880 
cccagagctg tgttcctcca aggggtccga 2940 
agacagtctg aggccaggga cacatgaagg 3000 
ccccaggctt ccagagttga aagccaggag 3060 
ccaagcaccc aatgatgcaa aaaacaaaaa 3120 
ttccagatct atttttctac cagagagagg 3180 
ttctgcactt catagttgga ttctgagctt 3240 
tggaaagggg aactgggatt tggggagggg 3300 
tcgacctcca ctgcgcccca cctccctgcc 3360 
aaggaataaa ttttctaagc tctttaaaaa 3420 
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(57) Abstract: The invention provides human transporters and ion channels (TRICH) and polynucleotides which identify and en- 
code TRICH. The invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also 
provides methods for diagnosing, treating, or preventing disorders associated with aberrant expression of TRICH. 



